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PREFACE 


Historically, array transform Processors have been largely 
integer-arithmetic devices, since the slower processing rate 

of floating-point arithmetic was undesirable when working 

with large arrays. of data. However, integer methods have 
problems which make programming awkward due to the limited 
dynamic range of integer arithmetic. Array scaling and “block" | 
floating-point techniques have either allowed human and other 
errors to creep into the results, or were costly and time 
consuming. Further, as processing became more sophisticated, 
even 16-bit integer data words were insufficiently precise 

for preserving the accuracy of simple 8-bit analog-to-digital 
converted input data. This is because the many multiplications 
and additions in typical cascaded array processing can cause 
the propagation of truncation errors*, eee * 


“With the advent of faster digital logic, many users realized 
that floating-point processing makes programming easier, virt- 
ually eliminates dynamic range problems, greatly alleviates 
the precision problem, and that it is potentially as fast as 
the last generation of integer processors. Floating Point 
Systems, Inc. recognized this trend in 1970, and was formed 

to specialize in floating-point peripheral processors. . 


The rush to floating-point processing was not a smooth one. 

Many floating-point formats Sprang up, and Floating Point 
Systems became expert in format converting "on the fly", so 
processing time would not be lost during a format conversion. 
Why convert formats? Simple. Not all formats are mathematically 
Clean, For example, it is unwise to use a hexadecimal-—exponent 
format for serious number crunching because a hexadecimal normal- 
ization can cause as many as 3 leading zeros between the binary © 
point of the mantissa and the first Significant bit. This 

means as many as 3 least-significant bits may be lost, due 

to right-shifting the mantissa past the available word length 
(truncation), when an extreme hexadecimal normalization occurs 
(about 25% of the time), and of Course, 2y.-Of 1, OF Ho. bits 

may be lost (with equal probability) for other possible hex- 
adecimal cases, | . ‘ 


*A 16-bit integer multiplied by a 16-bit integer results in 

a 32-bit product. If the result is truncated to the 16 most- 
Significant bits, then half the time the resultant's least- 
Significant bit (lsb) is wrong since it Should have been 
rounded up. Now the product of two of these potentially wrong 
lsb numbers results in the next lsb being wrong part of the 
time; thus cascaded operations propagate the errors leftward 


> 


toward the more significant bits. 


The FPS solution is to use a true 10- bit binary exponent, 
which has more dynamic range than the standard 7-bit hex— 
adecimal or 8-bit binary exponent. FPS then uses a 28-bit 
Mantissa, plus 3 guard bits, which provides enough bits to 
not only allow for hexadecimal in/out formats, but also to 
carry enough information to permit post- ~normalization and con- 
vergent- -rounding after each arithmetic operation. Thus FPS 
can receive any reasonable floating-point format that is de- 
Sired as the input format, convert it on-the-fly to the FPS 
format, process it in FPS. format with minimal truncation 
error propagation, and then convert on-the-fly to the desired 
output format. This procedure allows transparent "no pena- 
lty" operation on the data, ‘thus’ preserving the integrity 

-of the input data. 


In addition to the Hore chosen floating-point format, the 
AP-120B has a general-purpose, non-bus oriented architecture 
for the arithmetic units. This allows great flexibility in 
that operands and resultants can be moved simultaneously from 
any register in the AP-120B to any other. This rather general- 
ized structure of the AP-120B allows it to execute special- 
ized algorithms, such as the FFT, in times comparable to 

those achieved by hardwired special- —-purpose processors, but 


it also makes the AP- ‘1208 well suited to less highly organ- 
ized computations. 


In the matter of software, it should be noted that this mach- 
ine is a synchronous monolithic multiprocessor, as opposed to 
an asynchronous multiprocessor. The practical significance of 
this is that programming by the user and/or FPS (Standard 
Algorithms, System and Test Software) is tremendously simpli- 
fied, due to the predictability of data flow and timing con- 
Siderations. There is no need for internal hand-shaking be- 
tween arithmetic units, memories, and microprocessor; data 
and results are available at precisely determined times. The 
synchronous approach not only allows a non-stocastic simulator 
to be written for easy program debugging, but in addition pro- 
grams may be single-stepped in the real processor, with exe- 
cution identical to free-running programs. A further bonus 
of the synchronous design is the easy produceability, main- 
tainability, interchangeability, and reliability (there is no 
need to explore an infinite number of possible timing con- 
ditions as one clock phases by another, aS happens in an 
asynchronous machine). Convenient and rapid data-dependent 
‘branching, simple overlapping of data input, arithmetic pro- 
cesSing, and data output are further examples of the care 
taken to assure a fast, accurate, convenient and reliable array 
processor. | 
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SECTION 1 _ 
GENERAL IN” )RMATION > 


Ll INTRODUCTION 


The AP-120B is a high-speed (167-ns cycle time) per- 
ipheral floating-point arithmetic Array Processor, which’ 
is intended to work in parallel with a host computer. 

Its internal organization is particularly well suited 
to performing the large numbers of reiterative multiplications 
and additions required in digital signal processing, matrix 
arithmetic, statistical analysis, and numerical simulation. 

The highly parallel structure of the AP-120B allows 
the "overhead" of array indexing, loop counting, and data 
fetching from memory to be performed simultaneously with 
arithmetic operations on the data. This allows much faster 
execution than on a typical general-purpose computer, where 
each of the above operations must occur sequentially. 

The AP-120B achieves its high speed through the use of 
fast commercial integrated circuit elements and an archi- 
tecture that permits each logical unit of the machine to 
operate independently and at maximum speed. 


Specifically: 


1) Programs, constants, and data each reside in separate, 
independent memories, to eliminate: memory accessing 
conflicts. 

2) Independent floating-point multiply and adder units 
allow both arithmetic operations to be initiated every 

— 167 ns. 
3) Two large (32 locations each) blocks of floating- point 
- aceumulators are available for temporary storage of 
intermediate results from the multiplier, adder, or 
from memory. 

4) Address indexing and counting functions are performed 
by an independent integer arithmetic unit that includes 
16 integer accumulators. 


In a typical application, such as a Fast Fourier Trans- 
form, the above features allow nearly the entire computation 
to be overlapped with data memory access time. 

Effective processing precision is enhanced by 38-bits 
of internal data width, an internal floating-point format 
with optimum numerical pECperELeS, and a convergent rounding 


. eee tua 


1.2 SYSTEM OVERVIEW 


A general block diagram of AP-120B arithmetic paths 
appears in Figure 1.1. 

Connection is made to the host in a manner that permits 
data transfers to occur under control of either the Host 
Computer or the AP-120B. For most host computers, this will 
mean that the AP-1290B is interfaced to both the programmed 
I/O and DMA channels. 


Figure 1.1 General AP-120B Block Diagram 
HOST | 
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The system elements are interconnected with multiple 
parallel paths so that transfers can occur in parallel. All 
internal floating-point data paths are 38-bits in width (10- 
bit biased binary exponent and :-bit 2's complement mantissa). 

Data Memory (MD) is organized in 8K-word modules of 38- 
bit. words each, expandable up to 64K words in the main chassis. 
The effective memory cycle time (interleaved) is 333 ns. 

Table Memory (TM) is used for storage of constants (FFT 
constants), and is tied to a separate data path so as not to 
interfere with Data Memory. It is bipolar, 167 ns read-only 
memory, and is organized in 512-word, 38-bit increments. 

Data Pad X (DPX) and Data Pad Y (DPY) are two blocks of 

32 floating accumulators each. Each is a two-part register 
- block, ‘wherein one register may be read and another written 
fiom each block in one instruction cycle. 

The Floating Adder (FA) consists of two input registers 
(Al and A2) anda two- stage pipe-line which performs tne operations, 
and convergently rounds the normalized result. 
| The gloating Multiplier (FM) consists of input registers 
(M1 & M2) and a three- stage pipe-line which performs the multiply 
operation. Products are normalized and convergently rounded 
38-bit numbers. 
| The S-PAD consists of 16 integer registers and an integer 
arithmetic unit which is used to form operand addresses and to 
perform integer arithmetic. 

Section 2 contains a more detailed description of each of 
the functional elements, and Section 3 describes programming 
considerations. 

Section 4 describes in detail the host computer interface, 
which Floating Point Systems supplies. A number of off-the- 
Shelf interfaces are available. - = 


1.3 EXAMPLE AP-120B APPLICATION 
A simple FFT processing sequence would go as follows: 


Initial conditions are that the FFT program is resident 
in Program Source Memory internal to the AP-120B, the array 
to be transformed is resident in host memory, and the host CPU 
has initiated the AP-~120B processor with an I/O instruction. 


1. The AP-120B requests host DMA cycles to transfer the array 
from host memory to internal data memory. Data is con- 
verted from host floating-point format to internal AP-120B 
floating-point format "on the fly". 

oe The FFT algorithm is performed, with data remaining in 
internal AP-120B format. This yields the benefit of 38- 
bit precision and convergent rounding dur ing the critical 
phases of processing. 

3% The frequency domain array is transferred back to host 

memory by requesting host DMA cycles. Data. is converted from 

internal format to host. format "on the fly. 

The AP-120B proceeds to another process or stops executing 

depending on previously established conditions. An interrupt 

to the host can be issued. , : 


nS 
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The AP-120B is most efficiently «sed when a sequence of oper- 
ations is performed on one or more sets of data which reside in 
internal data memory. This reduces data-transfer overhead, and 
retains maximum numerical precision. For example, a reasonable 
‘Sequence would be to transfer a trace and a filter, FFT both,: 
array multiply, inverse FFT, and transfer the result back to host 
memory. — oa | 

| The AP-120B Data Memory has DMA capability. That is to say 
that MD cycles can be stolen from the AP-120B microprocessor by 
the interface. This capability allows Host Computer DMA to AP- 
120B DMA data transfers to occur, thereby minimizing both host 
CPU and AP-120B overhead. : 

~The AP-120B has been designed with enough flexibility built- 

in so that its power can be harnessed in a variety of ways. Sub- 
sequent sections describe its use in detail. 


1.4 PHYSICAL DESCRIPTION 


1.4.1 General. The AP-120B is available in rack configura- 

tion. Mounting is as a stan/ard 19-inch EIA rack-mounted 

unit, requiring 22-3/4 vertical inches of space. The unit 

is ‘equipped with rack slides, permitting easy access to the 
etched and/or wire-wrapped circuitry, chassis mounted on the forwar: 
portion of the unit. The Power Panel is mounted at the rear. 
1-3/4" of space should be available above and below the 

22-3/4" of the Processor. This is for proper intake and 

exhaust of air thru the Processor. The Control Panel 

(see 1.4.4), and/or blank panels, may be used for proper. 

spacing, if the customer's equipment mounted above and below 

the Processor do not have the proper free-air space Ruilt 

into them. Intake air should be between lO-C and 40°C. 


Lav rorwand Unit. The forward unit contains all AP-120B 
circuitry except the power supply. There is provision for up 
to 28 15- by 10-inch etched-circuit boards (ECB). The ECBs 
plug into a mother board. The ECBs are arranged in a vertical 

. plane (chimney style) with push-pull fans to assure adequate 
upwards air circulation even in the event of a fan failure. 

The I/O cable exits at the bottom rear (the exact configura- 
tion is computer dependent). This unit is called the Processor. 


1.4.3 Rear Unit. The Power Supply consists of three assemolies. 
The first is the main +5-volt supply, and is capable of LOO- 
amperes output. The two other smaller supplies are -5, and 

+12 volts. The power supplies have forced convection cooling. 
All supplies are rear mounted, along with the line box (con- 
taining line filters and ‘contactor, on the Power Panel). 


1.4.4 Power, Controls and Indicators. The AP-120B is ex- 
pected to be normally powered up and down with the rest of a 
system. The AP-120B switch and indicators are on a Control 
Panel. There is a single power cord (U.S. standard 3-wire 
with ground) which must be connected to 105 to 125 volts, 

50 to 60 hertz). The service should be rated for 20 amperes 
(or 10 amperes in the case of the higher ranges) in order 

to provide a low impedance source (power required is approx- 
imately 1200 valt-amps). The Control Panel may. be mounted above 
or below either the Processor or the Power Panel. Availa- 
bility of line power is indicated by a neon ''Line Voltage" 
indicator. If the "On Off" switch is On, then the power 
supplies should come on. There are two opération indicators; 
one shows Array Processor action, and the other shows DMA 
transfers. The three individual power supplies have separate 
indicators (electroluminescent diodes). There are no external 
adjustments. The internal adjustments are the three power- 
supply setting. potentiometers on the Power Panel. 


1.4.5 Serial Numbers. The Processor has a serial-number tag 

on its starboard side, near the top; Dorward, ending ‘in. "A" The 
Power Panel tag, ending in "B", is located inside near the top. 
The Control Panel has its tag, ending in "C', also "inside". 
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WIDTH: 18 (40.26 Cri) 
DEPTH: SO - 257 (SB.e3 - 63.5 Cri). 
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AIR SPack ELLOW THe AP- 12GB 
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Figure A ier’ AP-120B Physical Configuration 


1.5 SOFTWARE 


- Four packages of software are supplied with the AP-120B 
which assist the user toward the solution of his particular 
processing task, 


1.5.1 APEX (A.P.. Executive). APEX is a mechanism for commu- 
~nicating with the AP-120B via a series of FORTRAN or machine 
language subroutine "calls". The executive driver routine 
interprets the particular user call and directs the AP-120B 
to perform the specified action. For example, in Fortran, to 
load an array A containing N real data points into the AP-120B, 
- and Reorore a real Fast Fourier Transform upon that data: 


TA=0 


CALL APPUT (A, IA,N, 2) 
CALI: eREED CEA Nod) 


Both the Standard Applications Subroutines described below 
and user developed AP-120B programs may be called from the host 
computer using APEX, 


1.5.2 APMATH (A.P. Math Gipeeee These are 70 sub- 

routines written in AP-120B assembly language. They are callable 
from the host computer Fortran or machine language using BEA: 
They are listed in Tacle 1-2, 


Leovo Program Development Package. Four Fortran IV programs 
which are compiled on the host computer during installation 
aid user program development. 


These are: 


Ll. APAL | A.P. Assembly Language. A cross-assembler 
which provides a two pass assembly of 
symbolic coding into an coject module. APAL 


generates detailed error diagnostics. 


2. APLINE A.P, Linker. Links and relocates separate 
. APAL object modules together into a single 
execution module. 


3. APDBUG A.P, Debugger. An interactive debugging 
. program. The. user may selectively set 
breakpoints, examine and change memory 
and register contents, and run program 
segments. | 


4, APSIM | ALP. Simulator. Called by APDBUG, APSIM 
provides a programmed simulation of the 


APSIM (con't). various hardware elements of the AP-120B. 
All timing ch»racteristics of the AP-120B — 
are emulated, ond the floating point © 
arithmetic is simulated (including rounding) 
to the least significant bit. APSIM is a 
convenient tool in bringing up new AP-120B 
programs off line without interferring with 
production runs. 


1.5.4. APTEST. (A. P. Test programs ). APTEST is a collection of 
interactive diagnostic test and verify programs which aid in 
isolation of hardware faults. These are: 


-1. APTEST . A. P. Tester. Exercises the Panel, DMA 
interface, and various internal registers 
and memories. Tests Main Data Memory with 
simple patterns and then with random numbers. 
Board level Se a on indicators are 


provided. 
2.. APPATH  : A. P. Path Tester. Tests the various inter- 
| nal data paths and gives board level diag- 
| nostics. 
3. APARTH A. PB. Arithmetic Test. Tests the floating 


point adder, multiplier, and S-Pad arith- 
metic unit with pseudo-random aS and 
operation sequences. 


4, FIFFT Forward/Inverse FFT Test. Verifies the 
| 7 correct operation of the AP-120B as a com- 
plete unit by doing forward/inverse FFT 
transforms on both spikes and random num- 
“ber sequences. 


TABLE (oi 


FLOATING POINT ARITHMETIC TIMES 


Travel a Pipeline 

Time.» Interval 
Add/Subtract 0.333 us 0.167 us 
“Multiply 0.500 us | 0.167 us 
Multiply-Add 0.833 us 0.167 us 
‘Complex Add/Subtract 0.500 us 0;333: us 
Complex Multiply 7 1.333 us 0.667 us 
Complex Multiply-Add 1.667 us 0.667 us 


Travel time is the total time required to aoe from the data 
source to the destination, including the full transport through 
‘the arithmetic units. Pipeline Interval is the time between 
successively available resultants. The former is important 
when. the successive arguments of a computation are dependent 
upon previous calculations. The latter is indicative of the 
maximum throughput rate available for successively independent 
calculators. 


BASIC SCALAR {UNCTIONS 


Divide 

Square Root 

- Exponential 

Natural Logarithm 
‘Base 10 Logarithm 
Sine 

Cosine 

Arctangent . 
Arctangent of (Y/X) 


TABLE Ic 


Timing 


WoO PhD AD ADD OW 


p+ 


83 
<oo 
sae 
.00 
.67 
42 
.79 
“6¢ 
83 


us’ 


us 
us 


us 


us 
us 
us 
us 
us 


Program Size 


(AP-120B Program Words) 


28 
28 
27 
37 
37 
31 
eee 
46 
46 


These functions take arguments from Data Pad and return full- 
word accuracy results to Data Pad. 


‘coefficients for these functions 


512 words of Table Memory. 


Full-precision polynomial 


are contained on the standard 


TABLE 1. 5 


SUMMARY OF AP-120B FORTRAN CALLABLE ROUTINES 


" gal Vector Operations 


Operation 


Vector Clear 

Vector Move > 

Vector Negate 

Vector Add’. 

Vector Subtract 

Vector Multiply 

Yector Divide 
Vector-Scalar Add 
Vector-Scalar Multiply 
Vector-Signed Square 
Vector Absolute Value | 
Vector Square Root | 
Vector Logarithm (Base 10) 
Vector Natural Logarithm 
Vector Exponential | 
Vector Sine 

Vector Cosine 

Vector Arctangent 

Vector Arctangent of (Y/X) 
sum of Vector Elements 

sum of Vector Squares 

vot Product of two Vectors 
Vector Float 

Vector Scan and Scale (Fix) 


Vector Maximum 


Minimum Operation 


wMaximum Element in a Vector 
Minimum Element in a Vector 
Maximum Magnitude Element in a Vector 
Minimum Magnitude Element in a Vector 
Maximum and Minimum of a Vector 
Maximum and Minimum Magnitude of. 
a Vector 
Vector Maximum (of two vectors) 
Vector Minimum (of two vectors) 
Vector Maximum Magnitude of two 
vectors 
Minimum Magnitude of two. 
vectors 


Vector 


Name 


VCLR 
VMOV 
VNEG 


~ VADD 


VSUB 
VMUL 
VDIV 
VSADD 
VSMUL 
VSSQ 
VABS 


' VSORT 


VLOG 
VLN 

VEXP 
VSIN 
VCOS © 
VATN 
VATNZ 


SVE 


SVS 
DOTPR 


VFLT 


VSCSCL 


MAXV 
MINV | 
MAXMGV 
MINMGV 
MAXMIN 


MXMNMG 
VMAX 


_VMIN 


VMAXMG 


| VMINMG 


(us per point) 


Timing 
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NMector Filter Operations 


Vector Polynomial evaluate 
Difference Equations 


4 pole filter (difference equation) 


Complex Vector Operations 


Complex Vector Multiply 

Complex Vector Reciprocal 
Complex Vector Magnitude (Square) 
Rectangular to Polar Conversion 
‘Polar to Rectangular Conversion 


Matrix Operations 


Matrix Transpose 

Matrix Multiply 

Matrix Multiply (Dimension <32) 

Matrix Inverse 

Matrix Vector Multiply (3 X 3) 
Matrix Vector Multiply (4 X 4) 


Fast Fourier Transform Operations 


Complex FFT : 

Real FFT a 

Scrambled to True Order FFT Passes 
Bit-reverse Order an Array 

-Real Transform Unravel Pass 


Signal Processing Operations 


Convolution (or correlation) 
Wiener-Levinson Algorithm 

' Bandpass Filter 

Power Spectrum 

Complex Cepstrum 

Inverse Complex Cepstrum 
Schaffer's Phase Unwrapping 


*See examples below © — -12 


VPOLY 
DIFEQ 
RECUR4 


CVMUL 
CVRCIP 
CVMAGS 
POLAR 
RECT 


MTRANS 
MMUL 


-MMUL32 


MATINV 
MVML3 
MVML4 


CFFT 
RFFT 
STFFT 
BITREV 
REACTR 


_ CONV 


WIENER 
BNDPS 
PWRSPC 
ICEPST 
ICEPST 
SHPH 


b+ A 
ooNnunw 
IROOCO 


8 

x 

a 

zi | 
2.5/vector 
4.6/vector 


* 
* . 
2 


1.75/complex pt140 
* 7 


40 
27 
15 


26 
o1 
18 
118 
45 


17 
58 
Zt 
130 
30 
39 


187 


235 


139 
42 


102 

68 
287 
268 
289 
289 

ve 


EXAMPLE TIMINGS FOR STANDARD MEMORY (333ns in milliseconds) 


Fast Fourier Transform 


Real Data Complex Data 
Points Bit- Real 7 Bit- Complex 
. Reverse FFT Total Reverse FFT Total 
256 Oc2e 0.90 gee Bes 0.45 1.41 1.86 
512 0.45 1.76 2.22 0.90 3.48 4.38 
~ dey OBE: 0.90 4.18 5.08 £.e0 6.93 8273 
2,048 1.80 8.32 LO212 3.59 16.60 290.19 
4,096 3.09 «19237 ° “22,96 ee re be 33.16 40.33 
8,192 7.17 38.69 45.86 14.34 77.31 91.66. 
16,384 14.34 88.36 102.70 28.68 154.59 183.27 
32,768 28.68 176.68 205.35 57.35 353.29 410.64 
65,536 . 57.6 417.5 475.1 


Convolution or correlation 


Operator . Result 
Length Length Time 
32 100 0.70 
32 1,000 6.7 
100 1,000 1:9 50 
1,000 | 1,000 181.4 


Wiener-Levenson Algorithm 


Size Spike.Case General Case 
50 156 ald 

100 . . had oes 

200 26.6 60.4 


Matrix Operations 


Dimension , Transpose Add Multiply Inverse 
10 X 10 Or 20 ome | 0.63 2 

20 X 20 Ovat Od. 4.20 8 

30° X. 30 0.81 gia ie ee) Dt: 
90 X50. 2c2 Sie 68.3 230 
100 X 100 8.6 | 12 


.6 940.8 1,840 


(INTENTIONALLY BLANK) 


AP-120B | 
CONFIGURATION GUIDE 


Configuration: Rack Mount 


Common: floating or Connected to 
. Chassis 
Input Power: 105/125 V or 188/228 V 
or 210/250 V (20/11/10 A Service) 
50/60 Hz ‘: Or 50/400 Hz 
Data Memory: 38-bit words, MOS. 333 ns interleaved 
; . eycle time. 8K word increments: K 
56K maximum without expansion chassis. 1 


million words maximum with expansion chassis. 
Table Memory: _ 512 words of 38-bit bipolar ROM standard 
for transcendental coefficients. Order 
optional N-words for each 4N-point real 
or complex FFT in 512-word increments. 
words (65K maximum). 
Program Source Memory: 256 words of 64-bit bipolar RAM 
standard. Order in 256-word increments. 
words (4K maximum). 
Host Computer Type: _ Company & Model 
Host Operating System: | 
Purchaser: | | (name ) 


(company ) 


(address) 


(phone) 


me 


(P. O. Number) 


1-15 


(INTENTIONALLY BLANK) 


SECTION 2 
FUNCTIONAL DESCRIPTION © 


~The hardware of the AP-120B is eonpesed ae three 
types of functional elements. 


1. | Logical and control wienenes 
a. Control unit 
b. S-Pad unit _ . 
2. Floating-Point arithmetic elements 
| a. Floating-point adder 
b. Floating-point multiplier 
3 Memory elements 
. a. Data Pad unit 
b. Main data memory unit 
c. Table memory unit 


Each of these functional units is independent and thus 
can independently perform the programmed operations for which 
it was designed in parallel with the other functional units. 


2.1 CONTROL UNIT 

The Control Unit, as illustrated by Figure 2.1, consists of: 
Program Source Memory (PS). | 

Program Source Address (PSA) Register. 


Control Buffer (CB) with decoding logic. 
Subroutine Return Stack (SRS). 


200 2 


The operation of the AP-120B is controlled by the execu- 
tion of 64-bit instruction words which reside in Program 
Source (PS) Memory. The program word for the next instruc- 
tion. to be performed is selected by the address in the 
Program Source Address (PSA) register. At the initiation 
of the next machine cycle, this program word is transferred 
to the Control Buffer (CB) where it is decoded and executed. 
The PSA is incremented by one unless a branch in the current 
instruction causes the PSA to move to another location in 
Program Source (PS)memory. Access to Program Source memory 
and instruction decoding are overlapped so that the AP-120B 
can operate at a 6 MHz rate (167 ns). * 

Branching is accomplished in two manners. A short-range 
‘branch is provided by adding the 5-bit branch displacement 
field to the current PSA. This gives a branch range of from 
~20g to +173. A long-range jump to any location in PS is 
accomplished by loading the desired target address into PSA. 

Subroutine jumps are made by a "JSR" instruction which 
saves the current PSA in the Subroutine Return Stack (SRS) 
and sets PSA to the subroutine address. Return is via a 
"RETURN", which loads the PSA with the last entered return 
address on the SRS. 


268 


| Buffer(CB) 


SRa (Subroutine Return Address. $s the Subroutine Return Stack 
pointer, which is automatically incremented or decremented as 
subroutines are called and returns are made from the subroutine. 


Figure 2.1 Control Unit, 


Program 
Source. 
Memory . 
(PS) 


Program Source Address (PSA) 


Subroutine 
| Return 
SRA Stack 


Control 


S-PAD UNIT 


This unit, illustrated by Figure 2.2, performs the integer 


address indexing, loop counting and control functions necessary 
to direct completion of a. given algorithm. In form, it is similar 
to Tramiiiar mini-computers such as the PDP-1l or Nova. 


The S-Pad contains sixteen 16-bit directly-addressable 


registers. The contents of these registers pass through a special 
integer ALU associated with this unit. 


The output of the ALU may be directed back to the specified 


S-Pad destination register, and also to any of the following address 
memory registers: Memory Address ee Table Memory Address (TMA), or 
Data Pad Address (DPA). 


The 


S-PAD integer ALU functions include: 


Function | Effect | 
a. wove “§S*+D S-Source register 
b. Logical complement | S+D D-Destination register 
ec. Clear | | ~ §+D 
d. Increment a S+1+D 
e. Decrement . ; S-1+D > 
f. Add - D+S>D 

gg. Subtract D-S=+D 

nm. Logical AND) ~ D AND S+D 
ds dsOGLeal. OR D OR S+D 

ds Logical Equivalence D EQVS+D 


The output of the S-PAD ALU (called S-PAD FUNCTION or SPFN), may 


be used unmodified, shifted left once < SHLEted ricat ‘once, or shifted 
right twice. | 


the 
ia att 


A hardware bit-reverse function included in the S-Pad accomplishes 
bit swapping necessary to access data 1 Serenoece order after an 


The S-PAD ALU also sets three condition bits in the AP ~120B 
‘Status Register depending upon the output of the ALU/shifter : 


N: set if result <9; cleared otherwise 
Z: set if result =9; cleared otherwise 
C: Set if a carry occurred; cleared otherwise 


‘Rese bits may be tested by the next AP instruc- 


tion, and a branch made depending upon whether the “specified | 
condition was true. 


Figure 2.2 S-=Pad Unit 


i _ ‘oe weg ie ee 
igus Pad Address (DPA) Register | 
| a1 


Memory: Address (MA) Register | | 


Registers} 


Data Pad Bus (DPBS) 


2.3 FLOATING POINT ADDER UNIT 


The Floating Point Adder, shown in Figure 2.3, does addition 
(or subtraction) operations on the contents of the Adder input 
registers (Al and A2). The operation is performed in two stages, 
each of which takes one machine cycle. 

In the first stage, the exponents of the two numbers are com- 
pared and the fractions are aligned by shifting the fraction of 
the smaller number right. The fractions are then added (or sub- 
tracted). In the second stage the resulting fraction is normalized 
and convergently rounded. ~ | 

Since the two stages are independent of each other, a new pair 
of numbers may be entered into Al and A2 every AP cycle (167 ns). 
The result is available for use two cycles later (333 ns). 

In effect, the Floating Adder (FA) is a pipeline, where new. 
inputs may be entered into the pipeline stream every cycle. Ini- 
tiation of an add operation loads the two numbers to be added into 
the Al and A2 input registers. The previous Adder input is pushed 
down the pipeline to the Adder Buffer register. One cycle later 
the completed result (called FA) from the Buffer is available for 
storage or use by another unit. Thus a new add may be started every 
167 ns, and the result is ready 333 ns later. 

Al may be loaded from Data Pad (DP), from the output of the Float- 
ing Multiplier (FM), or from Table Memory (TM). A2 may be loaded 
from Data Pad (DP), from the output of the Floating Adder (FA), 
or from Data. Memory (MD). 

The output of the Floating Adder (FA) may be directed to the 
Multiplier (M2), to the Adder (A2), to Data Pad (DP), or to Memory 
Input (MI). ee 


‘The operations performed by the Floating Adder are: 


Alt+A2 

A1l-A2 

A2-Al 

Al EQV A2. 

Al AND A2 

Al OR A2 

Convert A2 from signed magnitude to 2's complement format. 
Convert A2 from 2's COMD eens to signed meee eon format. 
Scale A2 — | 

Absolute value of A2. 

Fix A2. 


woe 0g ho Ro ow 


Four condition bits in the AP Status Register are set or 
cleared by the Floating Adder depending upon the current result: 


ra ="S6U toO°One if result 1s zero, else cleared toi-zero. 
FN - set to one if result is negative, else cleared to zero. 
FO - set to one if exponent overflow occurred. The result 
was forced to the signed maximum value. 
FU - set to one if exponent underflow occurred... The result 
was forced to zero. 


The overflow and unerflow bits remain set until cleared by 
the pEcenam: 


| These bits may be tested by the instruction after the 
Floating Adder result is completed; i.e., three cycles after - 
the Floating Adder operation was initiated. | 
Figure 2.3 Floating-Point Adder Unit 
Al ZERO DPX DPY TM FM FA DPX DPY. MD ZERO A2 


Align 


fractions 

and add 

Buffer 

Normalize 7 
and Stage 2. 


round 


(2.4 FLOATING POINT MULTIPLIER UNIT 


The Floating Multiplier, Figure 2.4, forms the product of 
the two multiplier input registers (M1 and M2). The product 
is formed in three stages, each of which takes one machine 

cycle. ; 


In the first stage, the 56-bit product of the two 28-bit 
fractions are partially completed. The second stage completes 
the product of the fractions. In the third and final stage the 
exponents are added, and the mantissa product is normalized and 
convergently rounded. . 

The Floating Multiplier, like the Floating Adder, is or- 
ganized as a pipeline. Initiation of a multiply loads the two 
numbers to be multiplied into the M1 and M2 input registers. 
The two previous multiplier inputs are pushed down the pipeline 
to Buffer 2 and Buffer 3 respectively. One cycle later, the 
result from Butter 3 is available for storage or use by another 
unit. 

Thus a new seoaaee may be started every 167 ns, and: the 
result is ready 500 ns later. | 

Ml may be loaded from Data Pad (DP), the output of the 
Floating Multiplier (FM) or from Table Memory (TM). M2 is. 
loaded from Data Pad (DP), the Adder (Al), the Multiplier 
(M1), or to main Data Memory (MD). 


Two error ies in the AP Status Register are affected by the Float- 
ing Multiplier: . 


FO - set if exponent overflow eee The result was foreed 
to the signed maximum value. . 


FU - set if exponent underflow occurred. The result was forced 
to zero. 


Figure 2.4 Floating Multiplier 
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2.5 DATA PAD. UNIT 


Data Pad, illustrated by Figure 2.5, consists of two | 
fast accumulator blocks, each with 32 floating-point locations, 
called Data Pad X (DPX) and Data Pad Y (DPY). In a single 
machine cycle the contents of one location from each Data Pad 
may be read out and used. In addition, data may also be. 
stored into one location in each Data Pad in the same cycle. 

That is, for example, in a single instruction (167 ns) a multiply 
may be initiated specifying one argument from DPX and another 

from DPY; an Adder result (FA) may be stored into a DPX location, 
and a data element in Main Data stored into a DPY location. On 

the very next instruction similar multiple Data Pad accessing could 
be accomplished again. 


The two memories are addressed via a combination of the 
Data Pad Address (DPA) register and four index field values 
contained in4 given instruction word. DPA may be thought of 
as a base address register or stack pointer. It may be loaded 
from the S-Pad (SPFN) or its contents may be incremented or 
decremented by one. 


For a given read or write operation, say reading from Data 
Pad X, an index value contained in the instruction is added to the 
current contents of DPA to give the effective address for that 
particular operation. The four index fields (one each for read 
DPX, read DPY, write DPX, and write DPY) are each 3 bits wide, 
and have a range from -4 to +3 relative to DPA. 


Data from either Data Pad may be used by the Multiplier 
(M1, M2), Adder (Al, A2), or Memory Input (MI). Data may be stored 
into Data Pad from the Adder (FA), Multiplier (FM), S-Pad Function 
output (SPFN), the Command Buffer Value (VALUE),. or from Data 
Pad (DP). a, * * ; 


Figure 2.5 Data Pad 


INBS VALUE DPX DPY MD SPFN TM 
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2.6 DATA MEMORY UNIT 


The Data Memory unit, illustrated in Figure 2.6, is 
the primary data store for the AP-120B. It is available 
in 38-bit wide 8K modules which have an interleaved cycle 
time of 333 ns. 

The memory unit contains a Memory Data (MD) buffer and 
a Memory Input (MI) buffer. Data read from memory is placed 
by the controller into MD, while data is written into memory 
from the MI. The Memory Address (MA) register points to the 
desired memory location. 

: In referencing memory for read or write operations, the 
selected operation is initiated by making a change to the 
Memory Address (MA) register. The MA register may be loaded 
from the S-Pad (SPFN) or its contents incremented or decre- 
mented by one. 

A write operation is specified ag loading MI with the 
data to be written during the same instruction in which MA 
is changed. This data is then written into memory from MI 
during the next two AP cycles. Data may be loaded into MI 
from the Floating Adder (FA), Floating Multiplier (FM), Data 
Pad (DP), Memory (MD), Table Memory (TM), the Input Bus (INBS), 
~S-Pad Function (SPFN), or the Command Buffer Value (VALUE). 
A memory operation may be initiated every other cycle. The 
-intervening cycle may be used for any other AP-120B function 
except another memory initiate. 

. When a memory read is initiated, the requested memory 
data is placed by the memory controller into the Memory Data 
(MD) register 3 cycles after the request was made. Two in- 
structions after the read request, another memory operation 
may be initiated. Again, the intervening cycle may be used 
for any non-memory functions. Data in MD may be used by the 
Floating Adder (A2), Floating Multiplier (M2), or Data Pad 
(DP). 

To optimize the operation of the AP-120B it is necessary 
for the programmer to "look ahead" and initiate memory reads 
prior to the actual time that arguments from data memory are 
to be used in a calculation. 

The system provides a ''memory lock-out" which serves to 
insure that erroneous reads and writes of memory do not occur. 
If a memory initiate occurs while memory is "busy," further 
program execution is halted until the pEevsous memory cycle 
is completed. 


Figure 2.6 Data Memory Unit 
FA FM INBS VALUE DPX DPY MD SPFN TM 
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2.7 TABLE MEMORY UNIT 


The repeated use of standard constants (such as complex 
roots of unity and transcendental values) in signal processing 
-routines dictates their ready availability to the programmer. 
A separate Table Memory (TM), shown in Figure 2.7, eliminates 
memory accessing conflicts by allowing data values and table 
values (constants) to be placed in separate memory banks. 

Values read from Table Memory are placed by the controller 
into the Table Memory (TM) buffer register. The Table Memory 
Address (TMA) register serves as a pointer to the desired lo- 
Cations : , 

A Table Memory read is initiated by changing the contents 
of TMA, either by loading a value from the S-PAD (SPFN), 
or by incrementing or decrementing the contents of TMA. 

A new table value may be requested every machine cycle. 
This value is available for use two cycles later. The value 
may be used by the Floating Adder (Al), Floating Multiplier 
(ML), or Data Pad (DP). . 

' In FFT mode (i.e., when a FFT is being computed), the 
address.in TMA is interpreted by the hardware to be an angle. 
which points to the appropriate root of unity for a particular 
step in the algorithm. This allows the full table of roots 
of unity to be compressed into a single quadrant of cosines. 


Figure 2.7 Table Memory 


TMA 


y Vv 
Al M1 DPBS 


2.8 INTERNAL FLOATING POINT FORMAT 


‘Floating-point data internal to the AP-~120B is represented as 
follows: 


H) : 9 10 : | 5° ao 
EQ  . EQ M@ = M27 
Where: | | 
Mantissa 28-bit two's complement fraction 
Exponent 10-bit binary exponent, biased by 512 


The value of a EROSLINES point number in this format is defined 
as: . 


Mantissa * 2 (Exponent 512) 


The ey range of this format is from 0.5 * 27512 to 
(1-27-78 )#2°4)> or, from’ 3:7*10-)°2* to, 6. 7*10"?*.. 

The 28-bit fraction, combined with the convergent rounding 
algorithm used in the Floating Adder and Multiplier, gives a 
maximum relative error of 7.5*107? per arithmetic operation. 

This is a precision of 8.1 decimal digits. As a comparison, 
unrounded IBM 360 format gives only 6.0 decimal. digits of arith- 
metic accuracy. 

The convergent rounding hardware rounds up when the mag— 
nitude of the remainder is greater than 4 of the least signi- 
ficant bit of the mantissa. This serves to minimize truncation 
errors in long series of arithmetic calculations. 

Format conversion between Host format and AP-120B format 
occurs in the Interface and in the Floating Adder unit. The 
dynamic range of the internal format is large enough to accom- 
modate IBM 360 format and other Host formats. The extended 
precision of the AP-120B internal format insures that accuracy 
is maintained during critical stages of data analysis. 


N 
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| SECTION ? 
PROGRAMMING CONS... ERATIONS 


This section provides an introduction to programming 
the AP-120B. The principal operations which control each 
of the six functional units are described below. A complete 
listing of the AP-120B instruction word fields may be found 
in Appendix B. 

In the coding examples semicolon (;) is used to separate 


‘operations within a complete instruction word. A comma 
(,) separates operands. A quote mark Cy ts used to denote 
-a@.comment. A less than ("<'") is. used to mean "+" (replaced bv) 


where the operation involved is a data transfer. 


3.1 FLOATING POINT ADDER. 


3.1.1 Floating Adder Operations. Floating Adder cocrarions 
are initiated by the following instructions: 


Cinsemneio ws) (Operands) hosekeions Initiated) 
FADD | x 4 Al1l,A2 . Al+A2 
_FSUB | Al, A2 A1l-A2 
FSUBR Al, A2 A2-Al 
FAND : : -Al,A2 : Al AND A2 
FOR. | | Al, A2 : Al OR A2 
FEQV | Al,A2 Al EQV A2 
FABS A2 . - ABS(A2) 
FIX | A2 Convert A2, floating to 
| fixed 
FSM2C A2 - Convert A2, Signed Mag- 
nitude to 2's complement. 
F2CSM . A2 Convert A2, 2's complement | 
ad . £0 signed PeSn TEC 
FSCALE A2 Scale A2 : 


where Al and A2 are any of the following data sources: 


Al: FM Floating Mulitplier result. 
DPX Data Pad X accumulator 
DPY . Data Pad Y accumulator 
TM Last data read from Table Pane 
ZERO Floating- =Potny zero 
A2: FA Floating hades result 7 
DPX 
DPY . . 
MD ~ Last data read from Data Memory 


ZERO 


Any data source listed under Al may be combined with 
any data source listed under \2. For example, to add a 
number from Data Pad X to an:‘her from Data Pad Y: 

FADD DPX, DPY "DPX + DPY 
or ve subtract a number ead out of Data Memory from a con- 
stant in Table Menery: 

-FSUB TM, MD | "TM = MD 
A reverse subtract changes the order of the subtraction, i.e. 


FSUBR TM, MD MD — TM 


subtracts a constant from Table wemory from a number in Data 
Memory. 


To negate a. number from Dees 
FSUB ZERO, DPX | "Q.0 -— DPX = -DPX 

To take Se ee of a number from Data Memory: 
FABS MD | | "ABS (MD) 


To fix (convert from floating- -point to integer) a number from 
ieee | 


FIX DPY | "PIX (DPY) 


- 3.1.2 Adder Pipeline. The Floating Adder is a two-stage 


pipeline. A "FADD" instruction loads the designated operands 
into the Al and A2 registers. The previous contents of Al 

and A2 are pushed down the pipeline to the Buffer register. 
One AP cycle later the new contents of Buffer have been nor- 
malized and rounded, and are then available for use or storage 
elsewhere. | 

: The following instruction sequence illustrates how the 

_ Adder pipeline works, where A,B...G,H are floating-point . 
numbers to be added: , | 


~ a Adder Pipeline; | 
| . | Adder 
Time Cycle Instruction Al, A2 Buffer} Result (FA) 


pls, 
t) res FADD A,B A,B Soeee ene 
167ns 5. FADD C,D eens A,B* ee 
333ns 3. FADD E,F E,F =D. A+B 
500ns 4, FADD G,H_ G,H (ea C+D 
667ns 5. FADD — G,H E+F 
6. ane — G,H G+H 


833ns ae 


The 'FADD" without arguments in cycle 5 is used only to 
push the last computation into ‘‘e Buffer Register, and hence 
to the end of the pipeline. Thus, it is a dummy add in the 
sense that we don't care what its arguments are, since we will 
never use the results. In the above example we completed 
our floating-point adds in one microsecond. During cycles 
2-4, while we kept the pipeline full, adds were being done 
every 167ns, the maximum rate. a 

The completed results, as they come out of the Adder 
pipeline, are referred to by the mnemonic "FA". FA is | 
dynamic, in the sense that it must be used or stored else- 
where before being changed by the next floating-adder 
instruction. The programmer has, however, complete control 
over the pipeline. Arguments advance only when pushed through 
the pipeline by floating-adder instructions. 


3.1.3 An Example. A complete computational sequence is: to 

do the vector sum A,=A,+B;, 170,1,2,3. Aj; is stored in Data 
Pad X locations 0-3 and B; is stored in Data Pad Y location 

0-3. | 


1. FADD DPX(9), DPY(9) "De Ag+Bo 


2. FADD DPX(1), DPY(1) © , "Do A+B, 
3. FADD DPX(2), DPY(2); DPX(9)<FA "Do A2+B2, Ao+Bo is now 
| | | done, save it in Ao 
4. FADD DPX(3), DPY(3); DPX(1)<FA "Do A3+B3, A1+B1 is now 
. | done, save it in A} 
5. FADD DPX(2)<FA "Pugh Adder; save AotBoin Az 
6. DPX(3)<FA | | "Save A3+B3 in A3 


Below is a chart of this computation, showing the state of 
the Adder pipeline and Data Pad after each instruction is 
executed. | 


Cycle Re n5 Beier feeui a a = So :« 8B 
Ea a Ao, Bo --- --- Ao Al A2 A3 
2. " Wiis. Bi Shee tee a Ao Aq A2 A3 
3. A2, Ba Al, Bi pera ~  AotBo Al | A2 A3 
4.  . A3, B3 Az, Bo Ate, Ag#By A1+B, A> Ag 
5, feat he Bs eh. vaeGb, AGsey eee a 


6. --- Ag, B; Aj+B3; = Ag*Bg A+B, Az+By A3+Bs 


3.1.4 Floating Adder Tests. The fo! Lowing conditional branches 
test the Floating Adder result (FA). 


BR LOOP "Branch ieenatetoneiie to program 
"location ''LOOP" 
BFEQ LOOP "Branch if FA=0.0 
BFNE LOOP "Branch if FA#0.0 
BFGE LOOP "Branch if FA>0.0 
 BFGT LOOP | "Branch if FA>0.0 


The above branches test "FA" one instruction cycle after 
‘+ is ready for use. That is, an Adder result may be tested 
one cycle. after it: nas come out of the Adder pipeline. An example: 


nie -FSUB DPX, DPY "Do a computation 

2. FADD "Push the result out 

3. DPX>FA | "Save the result. 

4. BFEQ LOOP "Test the result here (branch to 

ues . " location "LOOP" if result was 
'" zero) 


Compound fests may be made also. Test MD to see if it is between 
a lower limit. contained in DPX (1) and an upper limit in DPX (2), 
i.e., see if. DPX(1)<MD<DPX(2): 


1. FSUBR DPX(2), MD "Do MD-DPX(2) 

2. FSUB DPX(1), MD "Do DPY(1)-MD . 

3. FADD . "Push first test result out 
4. BFGT. BIG . "Was too big 

5. BFGT SMALL "Was too small 

6. er aan oe "OK 


The branches are made relative to the current Program 
Source Address (PSA), with a 5-bit displacement value. This 
means that the conditional branch target address must be with- 
in ~208 tO +179 locations of the current instruction. 


3.1.5 Floating Point Logical Operations. These instructions 
(FAND, FOR, FEQV) perform logical operations on floating-point 
numbers. Exponent alignment occurs as for a normal floating- 
point add. The two mantissas are then combined using the. 
specified logical operation. The result is then normalized and 
rounded. 


3.2 FLOATING POINT MULTIPLIER - 


3.2.1 Multiply Instruction. Floating-point multiplies are 
initiated by the following instruction: 


FMUL M1, M2 


which initiates a multiply between M1 and M2, where M1 
and M2 are any of the following data sources: . 


M1 FM Floating Multiplier result 
7 DPX Data Pad X accumulator 
' DPY. Data Pad Y accumulator 

TM Last data read from Table Memory 
M2 FA Floating Adder result | 

DPX 

DPY 

MD Last data read from Data Memory 


Thus, any of the data sources listed under Ml may be multi- 
plied by any of the data sources in M2. For example, to mul- 
 tiply a number read from Data Memory by a constant from Table 
Memory: . 


FMUL TM, MD "TM * MD 


or, to multiply a number in Data Pad X by another number in 
Data Pad Y: 


FMUL DPX,DPY _ eae * DPY 


ee we dateipiter pipette, The Floating Multiplier is a 
three-stage pipeline. An "FMUL" instruction loads the . 
specified operands into the M1 and M2 registers. The two 
previous partially completed products are pushed down the 
pipeline to Buffer 2 and Buffer 3 respectively. One AP 
@ycle later the new contents of Buffer 3 have been normalized 
and rounded, and are then available for use or storage else- 
where. 

The following instruction sequence illustrates how the 
Multiplier pipeline works, where A,B...G,H are floating- . 
point numbers to be multiplier together. 


Multiplier Pipeline 


. | | Pi ophmieine., Gass e Multiplier 
Time Cycle Instruction M1,M2 Buffer 2. Buffer 3] Result (FM) 


Q oo FMUL A,B AB. Ae = — 
167ns— 2. FMUL C,D C,D A,B aid a 
333ns. 3. FMUL E,F Ek’ ,. 6.0 A,B —— 
500ns 4, FMUL G,H G,H EF ep A*B 
667ns. 5. FMUL ee. IG ' Er. C*D 
833ns 6. FMUL ieee eee Gu. + E*F 
1.0us 7. wee i a G,H G*H 


The "FMUL'' in cycles 5 and 6 7-2 dummy multiplies used 
to push the last two computations to the end of the pipeline. 
In the above example we completed four floating-point multi-- 
plies in 1.0us., During cycles 3-4, while the pipeline was 
full, products were being done every 167ns, the maximum rate. 

The completed products as they come out of the Multiplier 
“pipeline are referred to by the mnemonic "FM." FM is dynamic, 
in that it must be used or stored before being changed by 


the next "FMUL" instruction. 


3.2.3 An example. A computational example is to square the 
elements in a vector: 7 


A; = Ai*Ai, 1=0,1,2,3. Aj is stored in Data Pad X, 


1. FMUL DPX(9) ,DPX(@) . | "Do a 

2.° FMUL DPX(1) ,DPX(1) . | | “Do At 

3.  FMUL DPX(2),DPX(2) - "Do Ay 

4, FMUL DPX(3),DPX(3); DPX(9)<FM "Do a2, save a? 
5. FMUL; DPX(1)<Fi s "Save A? 

6. FMUL;. DPX(2)<FM | a "Save A’ 

7. DPX(3)<FM = | | | "Save A’ 


Below is a chart of this computation, showing the state 
of the Multiplier pipeline and Data Pad X after each instruc- 
tion is executed. 7 | 


Multiplier Pipeline _ | 7 ; 

| Multiplier Data Pad X © 

Cycle |M1,M2 Buffer 2 Buffer 3 Result (FM) . 0 a 2 3 
i . 


Ag,Ag --- ee eae Ry AG Rh he 
Pinve Myyy Mg ede? « “eee | Ao Ar Ag Ag 
See 8 -Ape ss Ag At Aa Ag 
4. A3,A3 Az, Ae a At a2 A, Az Ag 
Se See Mog . Bec we AR a2 An AG 
Gy, se eee he a az” a2 az Ag 
7. =-- --- A3,43 A2 A2 At a2 a2 


a 
o— 


3.2.4. Multiply-Adds. 


two eight-element vectors Aj° i. = TA.B. 
where Aj is in Data Pad X and B 


Fill the [1. ¥FMUL 
ree, 2, FMUL 
Pipeline 13. FMUL 
4, FMUL 
~ FADD 

Fill the 
Adder . - . FMUL 
Pipeline FADD 
.  FMUL 
FADD 
Both - 
Pipelines< 7, FMUL 
£12 FADD 
| 8. FMUL 
FADD 
9, FMUL; 
Empty the : 
Multipliers 10. FMUL; 

Pipeline > 
11. FADD 
12, FADD; 

Empty 


The full “pea Rina -point computational 
power of the AP-120B is utilized when we consider a process’ 
involving both mutiplies and adds, 


DPX(-4) ,DPY(-4) 
DPX(-3) ,DPY(-3) 
DPX(-2) ,DPY(-2) 
DPX(-1) ,DPY(-1) ; 
FM, ZERO 


DPX(#)) ,DPY(G) ; 
FM, ZERO 


DPX(1) ,DPY(1); 
FM, FA. 


DPX(2) ,DPY(2); 


FM, FA 


DPX(3) ,DPY(3) ; 
FM, FA 

FADD FM, FA 
FADD FM,FA 


FM,FA 


DPX(3)< FA 


the Adder 413. FADD DPX(3),FA 


Pipeline i 


L 


“tt 


Form the dot product of 
iB, i= 4, -3,... 1, 2, 3 
ay Gis in Data Pad Y: . 


"Do A219, 
"Do A_3B_3 
UDO A_»Bi»5 


"Do An; Bay. A_yBoy is 


'" now done, save it in 

" adder, 

"Do AgBg. A~3B_3 is now 

' done, save it in the 

" adder. ; 

"Do A,B). A_ Bo is now 
coming out of the mul-! 
tiplier, and A,B, 


tt: 


' from the adder, add 


them together. 


"Do A2B2; A_,B_1 is now 


tt 


coming out of the mul- 
tiplier, and A_3BL 

" from the adder, add 

'" them together. 

"Do A3B3.° AgBg is now 

' coming out of the mul-. 
'"  tiplier, and (A_yB_y +. 


"  A-2B_2) from the adder, 


"add them together. 

"A,B, is coming out of the 

" multiplier, and (A_3B_3 

ame oo Bs id from the 

'' adder, add them to- 

2 gether. 

"A2B2 is coming out of the 

“" multiplier, and (A_,B yy: 

+A.2B_2+A)By) from the . 
adder, add them to- 
gether. 

"A3B3 is coming out of 

the multiplier, and 

" (A_3B.3+A_, BL 1*A1B1) 

from the adder, add 

them together. 

"(Ay Boy +A_2B_2+A9Bg+45Bz ) 
*is coming out of the 
‘adder, save it in DPX(¢3). 

"(AW 3B j+A_;B_, +A, B,+A3B3). 

is coming out of the 
adder, add it to 
CApBe ytA.2Bi2+Ag Bot 
A2Bo) which was Saved 
in DPX(3). 


. FADD "Push result out of Adder 
DPX(3}<FA , "The result: (A.,B_yt+ 
ss wh A_ 3 BL qtA clo Bus tAny Buy + 
" AgBg +A,B,+4A2B2+43B3), 
" saved as DPX(3). 


one 


In accumulating the sum-of-products, the even term sum 
was kept in one half of the adder pipeline and the odd term 
sum in the other half. During cycles 5-7 when both pipelines 
were full, floating-point multiply-adds were being computed 
every 167ns. This is 12 million floating-point computations 
oer second. A longer sum of products calculation, involving 
more terms, would maintain this maximum computation rate for 
nearly all of the computation loop. Here, in a short calcu- 
lation, most of the time was spent filling and emptying pipe- 
lines, Even so, the seven adds and eight multiplies took 15 

eycles (2,5us) to complete, or an overall rate of 333ns per 
floating-point multiply-add. | 

As a further aid in understanding the multiply-add inter- 
action in the above sum-of-products computation, the chart 
below summarizes the computation: . 


Adder: a Data Pad: 


“Multiplier: 

Cycle [M1,¥ y [AL 22° PAe S 

ie Aci Bog. aes ye oe Spc 

2 Au3,Bi3 -=+ a ne fee 

3 A.2,Bi2 --- am eae ae 

a. > Me Bog Bog toy Ae, Bay,.o8 0 en ae 

5 AgyBo «= Ag *B3 AW3B_3,9.9 a ae 

6 Ay ,By Ap *B_p AnGB pj N ai Ba aes ae 

7 A>, Bo .ALyPALy AD Bey pA Bay: . ALgBey «=~ S45 

8. A3,Ba _ Ag *Ag AgBo, ES, ES, ae 

D -=-  A,*A,  A1B,, OS, OS, . Pere 

1.0), --- Ap *Ay AoB,, ES; ES; ee 
“Ee ~--  . Aj¥Ay.-43B3, 08; OS; - --- 

Wy a eee Pe BS, Ey 

13; = ==- “* OSy, ES, 08, ES, 

14. --- --- --- : ae ES, 

1S) Ss, eee aie = gt AREY OS, +ES, 
Bate " n terms of the even term.Sum: Aj iBy i = -4,-2,0,2 
OS is n terms of the odd term Sum: a; Asa = -3 ,71,1,3 


3.3 DATA PAD 


3.3.1 Data Pad Addressing. Data Pad is a block of 64 beans 
speed accumulators used to store intermediate results during 

a computation. In any given A?P-120B instruction the pLo- 
grammer has 16 of the Data Pad accumulators to work with, 8 

in Data Pad X (DPX) and 8 in Data Pad Y (DPY). They are | 
addressed relative to the current value of the Data Pad Address 
(DPA) register, which functions as a base register for Data 
Pad. For example, if DPA has a value of 24g, locations 20, 
through 2% would be available for use. - 


; DPX 


| 

| 

| ; 

; Available for use when DPAz2dg 

i 

{ : 

| 

i 


A displacement value from ~4 to +3 may be pebeetexee 
when using DPX and DPY, i.e., if DPA= =24d, : 


DPX(3) means DPX location 24+3=27 
DPY(-4) | means DPY location 24-4=20 
DPX(@) — means DPX location 24+0=24 
DPY means DPY location 24+0=24 


Four separate displacements are provided, one each for 
reading and writing DPX and DPY. Thus four separate locations 
in Data Pad may be used in a given instruction. With DPA=24g,, 
the following instruction occurs in one cycle: . 


FADD DPX(3),MD: FMUL TM, DPY(-2); DPX(-3)<Fa; DPY(1)<FM 
(read DPX) (read DPY) | (write DPX) _ (write DPY) 


This would 1) add DPX location to the last data read from Data 
Memory, 2) multiply the last data read from Table Memory by the 
contents of DPY location 22, 3) store the results of a previous 
add into DPX location 21, and 4) store the Roser rs of a previous 
multiply into DPY location 25. 1 


All 64 locations of Data ped: are accessed by Snneine 
the DPA pointer: 


INCDPA oe "Increments DPA by 1 


DECDPA . "Decrements DPA by Il 
SETDPA 2 "Loads DPA wit! the current S-PAD 


"function (SPFN, see ‘Section 3.6) 


‘Changes in DPA take effect for the next instruction after 
they occur; i.e., if we start with DPA=24: . 


1. FADD DPX(@), DFY(@); INCDPA "DPA. is still 24, so we add 
| | DPX24 to DPY24 
2. FADD DPX(@), DPY(@); INCDPA . "Now DPA=25, so we add 
he : | DPX25 to. DPY25- 
3.. FADD DPX(@), DPY(@) : Now DPA=26, so we add 


DPX2g to DPY26 


Thus, by successively incrementing DPA we can use Data 
Pad as a queue, or by properly incrementing and decrementing 
DPA, we can use Data Pad as a stack. Data Pad Address is 
circular. That is, with successive increments of DPA the 
next location after 37g is 9; with successive decrements 
Of DPA the next location after 9 is pee 


3.3.2 Writing into Data Pad. Data may be stored into DPX 
and DPY-from FA, FM, or DB (Data Pad Bus). 


DPX<FA "Store adder result into DPX 


DPX<FM : "Store Multiplier result into DPX 
DPX<DB "Store Data Pad Bus into DPX 
and DPY<FA "Store into DPY 
-~ DPY<DB 
DPY<DB 


The following may be qerectss onto the Data Pad Bus (DB): 


DB=ZERO "Floating-point zero 
' DB=INBS “TAput ‘Bus | 
DB=VALUE "A 16-bit immediate value 
DB=DPX "DPX 
DB-DPY — 'DPY | 
DB=MD "Last data read from Data Memory 
DB=SPFN "S-Pad Function (16-bit integer) 
DB=TM  ~° "Last data read from Table Memory 


PRUS: 2 DPA=248, we could have an instruction 
DPX(3)<FA; DPY(-2)<DB; DB=MD 
which would store the current Adder result into DPX location 


27 and store the last data read from Main Data memory into 
DPY. location 22 via the Data Pad Bus. 


3.3.3 Data Pad Bus. Data t: be stored into DPX and DPY . 
can be moved through three pathways: FM, FA, and DB. 
While FM and FA are fixed in meaning (output from the 
Floating: Multiplier and Adder respectively), the Data Pad 
Bus (DB) pathway can be connected to any one of eight 
possibilities, depending upon the programmers. CnOLCSs 
Some example situations: 


1. We-want to put MD into both DPX and DPY: 
DPX<DB; DPY<DB; DB=MD 


“We put MD onto the Dace. Pad Bus, and 
store the Data Pad Bus into DPX and DPY,. 


2. We want to put MD into DPX and TM into DPY: 
DPX<DB; DB= MD; DPY<DB; DB=TM © 
This is an sever’. Only one choice at a time 
can be made for the Data Pad Bus. Alas, not all 
things are possible even with 64-bits of program 
word. This double transfer would take two separate 
instructions to accomplish. 
“3. We want to store FA into DPX and MD into DPY: 
_DPX<FA; DPY<DB; DB=MD 


We put MD onto the Data Pad Bus in order to get 
it into DPY. FA goes directly into DPX. 


To simplify notation, data transfers involving Data Pad 
Bus can be written in a shorthand manner, as illustrated 
by the three above examples: 


Shorthand Longhand 
Ll. DPX<MD; DPY<MD DPX<DB; DPY< DB; DB=MD 
2. DPX<MD; DPY<TM. DPX<DB; DB=MD; DPY<DB; DB=TM 
a (Still an error no matter how we write iti?) 
3. DPX<FA; DPY<MD DPX< FA; DPY<DB; DB=MD 


In the shorthand notation, choices for the Data Pad Bus 

are not explicitly indicated. We write the transfers » 

as if there was a direct. connection between .the source 

and destination, while in fact it is the Data Pad Bus 
which does the connecting. The programmer must remember, 
however, that he is still making a Data Pad Bus choice, and 
that: only one choice is allowed per instruction. Errors 
like that in #2 (where we tried to make two Data Pad Bus 
choices) are detected and flagged by the assembler.. 


3.4 DATA MEMORY 


3.4.1 Memory Addressing. Main Data Memory cycles are 
initiated by changing the Memory Address (MA) register, 
which points to the memory location to be read from or 


“written into: 


INCMA | "Increment MA by 1 


-DECMA "Decrement. MA by 1 


SETMA “function (SPFN, see Section 3.6) 


All of the above initiate a memory cycle at the add- 
ress pointed at by the new contents of MA. If an ''MI" 
(Memory Input) field is also included in the instruction, 
then the memory cycle is a write cycle, otherwise a read 
cycle is initiated. When sequential memory locations are 
accessed, a new memory cycle may be initiated by- every other 
AP instruction. 


3.4.2 Data Memory Reads. Data read from memory is avail- 
able for use three AP instructions cycles after the "read" 
was initiated. The following instruction sequence illus- 
trates how memory data is accessed: A, B, & C are. floating- 
point numbers in memory locations 101, 102, 103 respectively. 
We assume that MA was set to 100 before we started. 


AP a a Memory Memory Data 
Time Cycle Instruction Address (MA) Result (MD) 
Oo o> I ~ INCMA 101 --- 
167ns 2 --- 101 --- 
. 333ns 8 INCMA 102 | --- 
o00Ons 4. a 102 A 
667ns os _ INCMA », Oe A 
~833ns 6. === " pikOes 7% B 
1.0Ous 7 -—— 103 | B 
1.17us 8 --- 103 c 


Three AP cycles after a given memory location was "read" 
the data from that location is ready in the Memory Data (MD) 
register and available for use. MD may be used by the Adder 
or the Multiplier: so . 


FADD DPX(3),MD; FMUL DPY(-2),MD "Do MD+DPX and MD*DPY. 


or also put on the Data Pad Bus and stored in Data Pad or 
back into memory: | - 


DPX(2)<MD : ee 3 "Store’ MD into DPX. 


3.4,3 An Example,. Load a vector AG. i= 0 ,L,2 stored in 
Memory locations 101, 102, 103 into 1px locations Los). ad 
12, We will assume that MA was set to 100 and DPA was 
set to 10 before we started, 


Ll. INCMA | | : | "Fetch Ag from Memory 

2.0 --- 7 

3.  INCMA- = _ "Fetch A, from Memory 

4, DPX<MD; INCDPA "Store Ag into DPX location 10 
7 7 "and bump DPA pointer to ll. 

5, -INCMA; °- ts "Fetch A>. from Memory 

6, DPX<MD; INCDPA "Store A, into DPX location ll 

| “and bump DPA pointer to 12. 
7. nee a | ie : ve | 
8. DPX<MD “Store A2 into DPX ‘location 12. 


Below is a chart of the above transfer, showing the state 
of each component after each instruction, 


Memory Data Pad | | 
MA MD © IDPA DPX19 DPX); DPX, 9! 


Cycle > 

ig 101 +--+ meee a ee 

2, LOL -—+ 10 ae ae | aes 
3% 102 --—— 10 ees eens panera 
4, 102 Ag. 10 Ag aoe a sais 
5. 103° Ay ll =—s- Ag a es 
Bre. 103 Aj "TL Ag AY --— 
Bra 103 «A, 12. Ao Ai sees 
8. 103 Ad | 12 = Ag — Ay — Ao 


3. 4.4 Data iemees Writes, Data Memory write cycles are in- 
dicated by: 


MI< FA | 7 "Write the Adder result into memory 
MI< FM | ‘Write the Multiplier result into memory 
MI< DB "Write Data Pad Bus into memory 


These instructions load data into the Memory Input (MI) buffer 
register, from whence it is written into memory. 

Data may be written into ee ica ad meuory locations by 
every other AP instruction. 


3.4.5 An Example. Square the elements of a vector Ay ,i=0,1,2, 
in DPX locations 10, 11, 12 and store the results into Data 
Memory locations 101, 102, (103, We will assume that MA was 
set to 100 and DPA was set to 10 before we started... 


Le, MUL DPX, DPX: INCDPA _ 3 4 “Square Ag, bump DPA pointer 
, Oe NOY slog 
2. FMUL 7 | "Push down the multiplier 
ak | “ie “Dipeline.s . 
FMUL DPX,DPX; INCDPA "Square A,;, bump DPA pointer 
1 tO. Ae, : 
4,  FMUL; MI<FM: INCHA , "Write A* into memory location 


ee “TOn., 


5. FMUL DPX, DPX iy 7% "Square Ay | 
6. . FMUL; MI<FM-: INCMA "Write AG into memory loc. 102 
7. FMUL es "Dummy FUL to empty pipeline. 
8. 


MI<FM; INCMA . 7 "Write AS into memory loc. 103 


Below is a chart of this eOnnucatien 


Mulitplier Memory 
cycle DPA [MI,N2 FW] [MA MT] 
1. 16 4k. ae 
oe Ps eee oe eae ee 

3, Te oAea et. 
4. 10: ee | AG EOL. AG 
5. “He. * “Poykoe se, On. aa” 

care ; 

6. Bee, Se a OR ae 
7 12 a 102 At 

- ; 
a — A? 103 A 


3.4.6 Memory Interleave. Data Memory is divided into 16 
banks of 4K words each using MAOO-MAO2 and MA15 as a memory 
bank select. (These are the three highest-order bits and 
the least-significant bit of MA.) Memory references to 
different banks may be made every 2 AP cycles, while refer- 
ences to the same bank may be made every 3 AP cycles. For 
some possible memory addressing sequences we have 


Memory Address Memory Bank - Memory Reference 
“Sequence (Octal) ; - Sequence . Timing . 

LOL sbO2. 2034" Oe... 1, 0, 1, 0,... every 2 AP cycles 
L66, °165,. 164... 163. 0,1, 0, 1,... every 2 AP cycles 
‘100, 102, 104, 106,... 6 Pema © aaa © er every 3 AP cycles 
233, 10374, 234,. 10376, ig ae Os “aa every 2 AP cycles 


Thus references to successive sequential memory locations may 
be made every other AP cycle,-but references to successive- 
odd or successive-even locations must be 3 cycles apart. 


3.4. 


7 Memory Lockout. If meme; references are made too 


rapidly for memory to handle, the CPU suspends program exe- 
cution and "spins" until the memory is no longer busy. Thus 
if we coded: . 


oie 


2. 
ve 


INCMA. “ "We are trying to refer- 

"ence memory every cycle. 
INCMA | 7 , 
INCMA 


We will get the following execution: 


0 
167 
333 
900 
667 


The 


ns eRe INCMA 
ns $2. INCMA 


ns "SPIN" 
ns =e INCMA 
ns vOP iN” 


processor "Waits" an extra cycle after instructions 2 and 


3 because memory is still busy from the previous memory refer- 
ences. This arrangement is fine if there is no useful computing 
to do during the "spin" cycles. Otherwise, it is better. to 
space our the "INCMA's'" and to do something useful during the 
cycles between memory references. 


3.5. TABLE MEMORY 


3.5,1 Table Memory Addressing. Constants stored in Table 
Memory are read by setting the Table Memory Address (TMA ) 
register to the address of the desired Table Memory location. 
This is done by the instructions: 


INCTMA | re ao “Increment TMA by 1 
DECTMA "Decrement TMA by 1 
SETTMA "Set TMA to the current S—Pad 


function (SPFN, see Section 3.6). 


Each of the above initiates a fetch from the Table 
Memory location pointed at by the new contents of TMA. 
Two AP cycles later the contents of the desired location are 
‘available for use. A new location may be fetched every 
AP cycle, The following sequence illustrates how Table Mem 
ory is accessed. KY K1, and K2 are constants stored in 
‘Table Memory location 235, 236, 237. We assume that TMA 
was set to 234 before we start. 


AP Table Memory — Table Memory 


Time Cycle Instruction Address (TMA) Result (TM) 
6) ‘oe INCTMA . 235 ee 

167ns os INCTMA . .236 | --- 
— 333ns ce INCTMA 20. | K@ 

—~6§00ns 4, aie | 237 : K1 

667ns = --- (237 7 K2 


Two cycles after a given Table Memory location was fetched, 
the data is ready in the Table Memory (TM) data register, and 
is available for use. TM may be used by the adder or the multi- 

plier: _ 


FADD TM, DPX(2);FMUL TM,DPY(-3) "Do TM+DPX and TM*DPY 
or put on the Data Pad Bus and stored into Data Pad: | 


DPX(-1)<TM ~ | "Store TM into DPX. 


3.5.2 An°Example. Do the vector sum Ai = B+K , i=0,1,2, where 
A; is in DPX locations 10-12, B; is in DPY 10-12, and K; is a 
series of constants stored in T: ole Memory loca cton 235-237. Aj 
will be stored back into DPX. we will assume that DPA was set to 10 
and TMA was ‘set to 234 before we start. 


INCTMA - o- 'Petch K 


he . 
2.  INCTMA | "Fetch Ky 

3. INCTMA; FADD TM, CPY: ~INCDPA "Do Kg +°Bg, bump DPA to 11 
4. FADD TM, DPY; “INCDPA "Do Ki + Bi, bump DPA to 12 
5. FADD TM, DPX (0); DPX(- 2)<FA "Do K2 + B2, store A in DPXl0 
6. FADD: DPX(-1)<FA - "Store A, in DPX11 

aan 


-DPX(0)<FA | - "Store Az in DPX12 


The following charts the above computation: 


Table Memory Adder - Data Pad X | 
Cycle TMA TM . A1,A2 FA DPA 10 dob 12 
lL. 235 _ ee ee 
2 236 Seles ig eta, eee fo: 228: eee. cee 
fade 237 Ko Ko ,Bog --- 10 ~--- --- —--- 
4, 237 Ke KR Ba ee Sy ee ee tee 
Sud BOE. Re Ke3By: KoeBg 22° hy See “eee 
6 237 = K5 eon “KeeBy 22) “Ags + Ay oan 


3.5.3 A Complex Multiply. An example using both memories 
is a complex multiply from the FFT (Fast Fourier Transform) 
algorithm. The multiply is between a complex signal point ~ 
held in Data Memory and a complex exponential value (a root 
of unity, ei?) fetched from Table Memory. The computation 
tS 

e eis acs Gs 


= * 
I ie wot hae: 


where C is the data setae and W is the complex exponential 
"R'" and "I'! denote real and imaginary parts respectively. C is in 
Main Data Memory, and W is in Table Memory. 


even the Ds, INCMA . "Fetch Cp from Data Memory 
4 arguments 2 INCTMA se "Fetch Wp from Table Memory 
| , 3. INCMA; INCTMA - ."Fetch C, fetch W, 
4.  FMUL TM, MD va "Do Cp * Wy 
Do the 5. FMUL TM, MD; DECTMA "Do Cp * Wy fetch Wy 
multiplies 6.’ FMUL TM, MD | | "Do C, * Wy . 
7. FMUL TM, MD; DPX(@)<FM "Do, C,*Wp , Save CpW,, In DPX 
8.  FMUL;. DPX(1)<FM : "Save CpW, in DPX 
Do the 2 9. FMUL; FSUBR FM, DPX(D) "Do Xp + CaWy-CrW, 
‘adds. 10, FADD FM, DPX(1) "Do Xz = CgWy+ CyWp 


3-17 


11. DPX(@)<FA; FAD Xp, is ready, save in DPX. 
12.  DPX(1)<FA xX, is ready, Save in DPX 
The total elapsed time is 12 cycles or 2us. In practice, 
however, we can overlap all but cycles 4-7 with the preceding 
and following computations. The complex multiply then takes 
us Only 667ns, when mixed in with other computations. 


Below is a summary chart of the complex maoranees 
Memories Multiplier _ Data Pad 


Cycle {TM MD | M1, M2 de ie T° “aD af [Oo 1 | 
pola os 


om os me ——<— = 


D) bumetes, suite Aces ae whee ees peer 
3 eet, eee, “tee oe ae foe ee een 
4 Woe SOs “WicGe izes? Soe meee 
5 Wr CR OWE, CR --- === --- --- --- 
6 Wr oC, OWi,Cr === | == mm- ee 
7 We Cl Wi,Cr W*c, --+ --~ WRC, --- 
Q wow ‘abe lake < ; Wr*Ch mie ae WRCp Was 
9 fs, Me, | ceased Wr*C WrC,,W,C.--- WrCr. WrCr 
1 . -——— -——— -—— Wp*C, Weel, W rc RXR WrCp . WrCr 
dd, ---  ---  --- ---  =--- see XR WrCr 
12 eee eee eee --- = XR XI 


3.6  S-PAD 


The S-Pad is a 16-bit wide inte ’or unit used primarily 
to compute memory address pointers.and to test loop counters. 
It is similar in capability to a minicomputer and programs 
like the register-to-register instructions of a Nova or 
PDP-11 computer. There are 16 registers in the S-Pad unit. 


3.6.1 Single ee Instructions. Choose one from each column. 


| nee 


Destination 
Register 


dst; 


The Operation is performed upon the contents of the 
Destination Register (dst) and that result is Shifted. The 
shifted result is stored into the Destination REgister unless 
a No Load (#) is specified. The shifted result is the S-Pad 
Function (SPFN), which may be stored into an address register 
(MA,TMA, or DPA), or placed onto the Data Pad Bus (DB=SPFN). 
Some examples, where SP, refers to the contents of S-Pad 
register "n'': | . | 


‘INC 6 | "(SP +1)+SP, 

DECR 3 | 7 | ee "(SP3-1)/2+SP3 

COM 3; DPX<SPFN _ es | "5P5+SP,+DPX - | 
CLR# 2; SETDPA ---s "Q=DPA; because of # (no load) 


SP, remains unchanged. 


3.6.2 Double Operand Instructions. Choose one from each column. 


Destination 
Register 


Source 
Register 


Operation Shift 


MOV --- dst, 
ADD R 

SUB L 

AND RR 

OR 


The Operation is performed between the Source (sre) and Des- 
tination (dst) registers. If Bit Reverse (&) is specified the con- 
tents of Source are bit-reversed before being used. The Shift is 

performed on the result, which is then stored into the Destination 
Register, unless No Load (#) is specified. The shifted result is the 
S-Pad Function (SPFN), which may be stored into TMA, MA, or DPA, or 
placed onto the Data Pad Bus. Some examples: 


MOV 3,15 "SP, SP, 5 


ADDL 6,10; SETMA | " (SP, ,+SP,)*2 SP, ,>MA 
— SUB 7,13 | : ;  -"(8P,4-SP,) SP); 
AND#5,11; SETDPA | '"(SP;,AND SP;)+DPA 
ORF -&6,7; SETTMA . _ a (SP, OR SP, (Bit-reversed)) +TM 


MOVRR 2,2 "(SP )/4+SP5 


For purposes of program clarity, the assembler allows 
names to be given to the S-Pad registers. If register "PTR" 
is a pointer to an array in Data Memory, and register "STEP" 
contains the increment value we are using to step through 

array, then: 


. ADD STEP, PTR; SETMA. 


will advance our array pointer by the proper increment and 
fetch the next array element from memory. 


3.6.3 S-Pad Tests. The following conditional branches test. 
the S-Pad Function (SPFN). 


_ BR LOOP "Branch unconditionally to program location "LOOP" 
BEQ LOOP "Branch if SPFN=0 
BNE LOOP "Branch if SPFN#0 
BGE LOOP _ "Branch tf SPPN+0 
BGT LOOP "Branch if SPFN>0O 


The above branches test the S-Pad result from the imme- 
diately preceding AP instruction. Thus an S-Pad operation 
must be done one instruction cycle before it is desired to test 


the result: A loop counting example: 

1 DEC 2: “Decrement SP2 | 

2. BNE LOOP "Branch to "LOOP if SP2has not yet reached 
Pig "Zero. 


.. £est the scOntents Of: SP3to- see 1 it is between a lower 
limit contained in SP2 and an upper limit in SPs, i.e. see 
if oPs< SP3< SPy. . 


i. SUBR S52 et 
2. SUB 4,3; BGT SMALL "Too small, SP3<SP, 
Sv. BGY BIG "TOO big, SP3>SPy 


The branches are made relative to the current Program 
Source Address (PSA) with a 5-b't displacement value. This 
means that the branch. target ad: ess must be within -~20gto +17 
locations of the current instruction. . 


3.6.4 An example. Load Data Baa X with an array "A", with 
N elements starting at Main Data Memory location 721 5% 
"CTR" is in S-Pad register which wtll be used as a counter. 


ae CLR# CTR; SETDPA "Set DPA to 9g 

2. LDMA; DB=3721 . "Fetch the first element 

Sa: LDSPI CTR; DB=N _ "Initialize "CTR" to N 

4. LOOP: INCMA; DEC CTR. "Fetch next element, A,+! 

5. DPX <MD; "Store A, into DPX; , advance 
INCDPA; BNE LOOP "DPA and test counter. 


Below is a chart of the above loop, for N=3 elements. 


Inst. Memory Data Pad : S-Pad 

7 MA. MI * DPA O 1 2. "CTR" Test 

1, ---  --- 0 ieatliateatiinatatl --- 

or. S724. - see 0 goss oe 

3. ee 0 se Sea) aay 3 

4. voy ae Ot seowetae 3 

5. sete » Ae 3 0. Ao -- -- 2 true 

4, 3723 0 ===. 1 Ag -- -- 2 | 
ae — Al 1 Ao Al -- si true 

4. --- --- 2 Ag Al -- a 

°. --- A2 2 Ao Ai Az 0 false 


A generalization on the above example is to fetch array 
"A" From every Kth memory location. 
The increment is stored in S-Pad register "STEP', and the 
array pointer is stored in "PTR": . 


1 LDSPI STEP; DB=K ) “Initialize "STEP" to K 
2. CLR CTR; SET DPA "Set DPA to 0. 
Se 4 LDMA; DB=BASE "Fetch the first element, Ag 
4 - LDSPI CTR; DB=N "Initialize “CTR” to. N 
5. LOOP: ADD STEP,PTR; SETMA .'Advance memory pointer. Fetch 
BEQ DONE | " “next element, Ajt+t!. Test 
. ‘counter and jump out if 
done. 
6. DPX<MD; INCDPA . "Store. A;into- DPX; , advance DPA 
DEC CTR; BR LOOP . s Decrement "CTR" and jump 


, "back to LOOP. 
7.  DONE:--- ‘ 


SECTION 
INTERFACE 


This section describes the interface between the Host. 
computer and the AP-120B. The interface is composed of two 
basic parts, 1) a simulated front panel and 2) direct memory 
access control (DMA). The front panel allows the host computer 
to examine or modify the internal AP-120B registers, as well as 
to start and stop program execution. The DMA control . 
provides for block transfer of data from the host computer 
to the AP-120B, and vice versa. ; 


4,1 FRONT PANEL 

The AP-120B "Panel" is used for bootstrap operations 
(loading and starting programs) and for debugging of 
user software (inserting hardware breakpoints and examining 
and modifying AP-120B registers and memory). The "Panel" 
consists of three 16-bit registers which are under the control 
of the Host via the Host interface. The functioning of these. 
registers closely parallels that of the switches and lights on 
the console of a stand-alone computer. The Host can examine 
and/or set ‘these registers at any time, irrespective of the 
state of the AP-120B. ; 


4.1.1 Switch Register. The Switch Register (SWR) is used to. 
enter data and'addresses into the AP-120B. The SWR can be read 
and written by the Host computer. An executing AP-120B program 
can also read the switches. | 


4.1.2 Lights Register. The Lights Register (LITES) simulates 
front panel lights, and is used to display the contents of 
internal AP-120B registers. This register can only be read 

by the Host. The executing AP-120B program can set the Lights 
Register. . 


4.1.3 Function Register. The Function Register (FN) provides. 
front-panel control operations (start, stop, continue, etc.). 
It can be read or written by the Host. ~ x 


PANEL FUNCTION REGISTER FORMAT 


14 iS 
REGISTER SELECT 


When the AP-120B is running only the STOP and RESET panel functions 
are valid. The other panel functions can only be exercised after 
the AP-120B has Ree halted. 
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Function Register Bits 
Bit Mnemonic | Effect 


Bit 0 STOP /HALTED Stop AP-120B program execution upon com- 
. pletion «f the current instruction. When 

the Host reads the FN register this bit 
reflects the current state of the pro- 
cessor, i.e., it will be a "1"' if the 
AP- 120B is halted. NOTE: If the current 
instruction does a "SPIN" while waiting 
for I/O or memory, the STOP will not 
take effect until the spin condition is 
satisfied and the instruction completed. 


Bit 1. START ' .Start program execution at the address 
Specitied.1n SWR. 


Bit 2 CONT | Continue program execution at the 
instruction pointed at by PSA (Program 
Source Address). 


(oe 


Bit STEP Execute the instruction pointed at 
by PSA and then halt. Advance PSA to 


point. to the next instruction. 


Bit 4 RESET ~ $§top the AP-120B immediately. Clear 

| | S-Pad register 9, set SPFN to SPspp , 
clear the AP-120B status register, stop 
the host DMA (CTL bit 15 set to @) and 
clear Main Data Memory timing. 


Bit 5 EXAM Examine the register or memory selected 
~~ by the Register Select field. Display 
the portion selected by the WORD field 
in the Panel Display Register. 


Bit 6 DEP. pesesit the contents of the Switch Register 
. into the register or memory selected by 
the Register Select field. Deposit into 
the portion selected by the WORD field. 


Bit. e BREAK Enables hardware breakpointing if PSA, MA, 
| - or TMA is specified in the Register Select 
‘field. The breakpoint. causes the AP-120B 
to halt one instruction after any instruction 
where the contents of the selected register 
was equal to the Switch Register. Thus, if 
a breakpoint is specified with PSA selected 
the AP-120B will halt after executing the 
instruction at the program location set in 
the Switch Register. PSA will be pointing 
at the next micro-instruction in sequence. 
Ifa breakpoint is called for on MA or TMA, 
the AP-120B will halt after executing the 
instruction following the one that referenced 
the trapped memory location. PSA will point 
to the second following sequential instruction 
after the one that caused the breakpoint. 
Memory breakpoints aid in debugging those 
elusive errors that modify. memory unexpectedly. 
~—4-3~— 


> Mnemonic 


Bits 8-9 INC 


Effect 


Increment MA, TMA, or DPA following completion 
of the other specified panel functions. This. 
allows sequential memory locations to he 
examined or deposited into. | 


Value in Bits 8-9 Address Register 


Bits 10-1ll WORD 


wn tk © 


to be incremented 


None 

MA (Memory Address) 

DPA (Data Pad Address) 

TMA (Table Memory Address) 


Specifies which portion of a register is 
being examined or deposited into. 


Portion of Register Affected 


Value Set in <16-Bit 38-Bit —  64-Bit 


Bits 10-11... ~ ‘Register Register Register 
g ALL N/A Word J, Bits 9-15 
ue . N/A Exponent Bits Word 1, Bits 16-31 


3 


Sits 12-15 REG. SELECT 


OF:-to G9. Right 
justified in 16- 
bit. field. 
N/A High Mantissa “ord 2, Bits 32-47 
Bits @@ to ll. 
Right justified 
N/A . Low Mantissa Word 3, Bits 48-63 


Specifies which AP-120B internal register 
or memory location to examine or deposit 
int.O. | : BO 


Function: Register Bits Cont’ 


Octal Value set in 
Bits 12-15 
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Register 
or Memory 


Selected 


AP Status 
DA 


~ Host Data (EXAM only) 


Program Source Address 


_S-Pad Destination Address 


Main Data Address 


Table Memory Address 


Data Pad Address 
S-Pad Function (EXAM) 


S-Pad addressed by SPD (DEPOSIT) | 


AP-120B Internal Status Reg. 
Device Address Register 
Program Source Memory addressed 
by TMA 
- Reads 
output of Format Conversion Reg. : 
Control Buffer, Bits 48-63 
(EXAM only) 
Data Pad X addressed by (DPA-4) 
Data Pad Y addressed by (DPA-4) 
Main Data Memory addressed by MA. 
S-Pad Function (EXAM only) 
Table Memory addressed by TMA 
(EXAM only) . 


4.2 


4.2.1 


NOTES ON THE USE OF THE FRON. PANEL AND BREAKPOINT 


“Where does the AP stop on a breakpoint?— 


a) With the breakpoint set on PSA, the AP+120B will stop 
. with PSA pointing to the next instruction to be exe- 
cuted. 


Thus breaking on a branch instruction and then ex- 
.amining FSA will show whether the branch condition 
was true or false. 


b) With the breakpoint set on TMA the AP-120B will stop 
with PSA pointing to the second instruction following 
the one that set TMA to the break address. 


¢c) With the breakpoint. set on MA the AP-120B will stop on 
either the next instruction or the second instruction 
after the one that set MA’to the break address, depending 
on the state of the memory lockout hardware (next in- 
struction if memory lockout, second instruction if no 
memory lockout). | 


Thus the stopping point following an MA breakpoint 
will have a one instruction uncertainty. 


Does the instruction on which the AP stops execute? 


Since SPFN is current, it will be set to the operation 
specified in the instruction that PSA is pointing to. 
Otherwise, the instruction that PSA is pointing to 
remains unexecuted and will execute correctly when the 
user steps or proceeds from the a ac 


What about MD timing and lockout on a breakpoint in the 
middle of an MD memory cycle? 


a) The hardware has been designed so that the AP can be 
stopped in the middle of a memory cycle. The hard- 
ware remembers where the memory timing was when the 
AP stopped so that the processor can continue correctly 
from a breakpoint that occurs during a memory cycle. 


b) However, the user must not examine MD nor should 
there be any DMA transfers going to or from MD 
while the AP is stopped if the user wishes to 
proceed from the Dir esnostes 


Thus, for example, it is possible to break in the 
tight-to-memory portions of the FFT and examine 

DATA. PAD or the address registers (PSA, SPA, etc) 

and then proceed. But it is not possible to proceed 
if the user or the. host interface disturbs the memory 
timing by reading or writing MD or TH. 


ic 
.e)) 


4.2.4 Summary of the rules for proceeding from breakpoint. 


If the breakpoint causes the AP to stop in the middle 
of the memory cycle (PSA pointing to first or second 
instruction following SETMA, INCMA, DECMA or LDMA), the 
user should not try to examine or modify MD. 


4.2.5 What about stepping the AP? 


The same rules as for proceeding from a breakpoint 
apply to stepping the AP through a program. The user 
can examine and modify any register of memory within 
the constraints mentioned in 4.2.4 above. 


4.2.6 What other pitfalls are there in the use of the virtual 
front panel? 


a) Note that the panel always examines SPFN not SPospp: 

| Thus, if the user wishes to see SPopp he must 
fonee SPFN = SP spp - This can most easily be done 
via the panel reset. function which has the unhappy 
Side effect of also clearing SP(Q). 


~b) To examine TM, the user should first set TMA and 
then do a dummy panel operation (deposit TMA again 
for example) inorder to enter the output of table 
memory into the table memory buffer register. He can 
then proceed to examine the addressed location using 
the i sal panel functions. 


ec) MD 


Setting MA from the panel initiates an MD memory 
read cycle. Depositing into MD from the panel 
initiates an MD memory write cycle. 


Thus, to write MD and then examine what was just 

.written, the user must perform a deposit into MA 
operation (with the same address) to initiate a. 
read cycle before examining MD. 


d) Using the Increment field in the FN register. 
DPA and TMA always increment after the EXAM or DEP 
operation is complete (remember that TMA is used 
to address program source memory for pdnel operations). 


MA post-increments and initiates a new memory read 
cycle on an EXAM operation. 


“MA pre-increments on a DEP operation. 


1B 
1 4 
~) 


e) 


Starting the AP 
The recommended starting procedure is as follows: 


i) Set the SWR to the starting address and do a 
deposit into PSA 


ii) Set the SWR to the desired breakpoint and do a 


continue to start the AP-120B. 


This proesaiwe has the significant advantage that it 
places the necessary breakpoint code into the user's 


program should he need to debug his AP program. 


‘The panel START function can be used but the user 


should observe the following restrictions on the first 


instruction executed by the AP-120B: 


The first instruction should not branch or jump or 
modify PSA in any way other than to advance to the 
next instruction. The first instruction should not 
use the SPEC or IO fields. In fact, the preferred 
first instruction is a NOP (all zeros). 


4.3. DIRECT MEMORY ACCESS . 

In addition to the Panel func’ ion, the AP-120B contains 
four 16-bit registers that are use: for Direct Memory Access 
(DMA) to both Host and AP-120B epee memory plus a 38-bit 
Format Conversion Register that acts as buffer between the two 
memories. These registers may be read and/or loaded from either 
the Host computer or the AP-120B. 


4.3.1 Host Memory Address Register. The aout Memory epi wees 
(HMA) points to consecutive locations in.the memory of the Host 
Computer. It operates in either an auto-increment or auto- 
decrement mode during DMA transfers to and from Host memory. 
HMA is Device Address 1 for AP-120B internal I/O transfers. 


4.3.2 Word Count Register. The Word Count Register (WC) counts 
the number of Host memory words transferred in a DMA operation. 

It is preset to the desired number of words to be transferred and 
counts down as the transfer proceeds, stopping the DMA transfer 
when it reaches zero. Hardware logic prevents this register from 
being counted past zero. WC has AP-120B Device Address 9. 


4.3.3 AP Direct Memory Address Register. The AP-120B Memory 
Address Register (APDMA) points to consecutive locations in AP-120B 
Main Data Memory during DMA transfers to and from MD. This register 
can operate in either auto-increment or auto-decrement mode. APDMA 
has AP- 1208 Device Address 3. 


~ 4.3.4 Control Register. The Control Register (CTL) acts as a 

control over the DMA and interrupt functions of the Host Interface. 
This register controls the direction and mode of transfer (DMA or 
program control), the type of data format, and provides certain 

bits a status information peryefnene to the transfer. CTL has AP-120B 

Device Address 2. 


DMA CONTROL REGISTER FORMAT 


9 | 10 
ap {wrt |DEC |DEC|FM 
| DMA HOST |APMA| HMA 


5 6 
IH |FERR 


ae | WC 


All bits are read/write except as noted. 


Bit 0 WC=9 #Indicates that the Word Count Register is zero. Note 
that WC is decremented only during DMA Transfers to/from 
Host Memory (Read only bit). Should not be used to 
-monitor DMA activity. 


Bites -INTR AP Set the INTRQ (Interrupt Request) flag in the AP-120B. 


Bit 2 IAPWC Set INTRQ (Interrupt Request) flag in the AP-120B when 
the DMA transfer is done. 


Bit 3 IHALT Enable a Hest interrupt when the AP-120B halts. 


Bit 4 IHWC Enable a Host interrupt when the DMA transfer 
is done. | | 
Bit.'S IHENB Interrupt Host Enable. Interrupt Host if 


AP-120B attempts to set this Bit. This bit 
can actually be written only by the Host. 


Bit 6 . FERR Format Error. Indicates that exponent 
underflow or overflow occurred in conversion 
from AP-120B format to Host floating-point 
format. : cal & 


Bit 7 DLATE Data Late. Indicates that the AP-120B did 
not empty the format buffer before the 
- Host attempted to reload it. On some Hosts 
this bit also indicates an attempt to access 
non-existent Host memory. In either case 
the DMA transfer is terminated. 


Bits CC. Consecutive Cycle. Block DMA transfers 
to/from Host memory will occur without 
interruption. On typical Hosts, the Host 
CPU will be locked out but other higher 
priority DMA devices will still have access 
to Host memory. 


Bit 9 APDMA Allows the interface to perform DMA transfers 
to/from AP-120B memory. Depending on the 
direction of transfer, a Main Data memory 
cycle is initiated every time the Host 
finishes reading or loading the format 
register, whether via DMA or program control. 
On the AP-120B side, the format register is 
loaded from the Main Data Bus instead of the 
‘Data Pad Bus. | 


H Bat: 30 WRTHOST Write to Host. This bit controls the 
direction of transfer. If set, data is 
read from the AP-120B, passed through the 
format register, and written to the host. 
If clear, the direction of transfer is 


reversed. 
Bit 11 DECAPMA Decrement APMA. re Ser. APMA is decremented 
| - during DMA transfers to/from AP-120B Main 
Data memory. If clear, APMA is incremented. 
Bit: 12 DECHMA Decrement HMA. If set, HMA is decremented 


during DMA transfers to/from Host memory. 
If clear, HMA is incremented. 


Bits 13214 FuT 


Bit 15 


‘Value 
Bits 


Format Register Control. 


Note that the format register mode of 

operation is controlled entirely by hits 
9, 10, and 13 and 14 of the control register. 
Thus even though the Host and the AP-1208 


‘can load and read the Format Register via 


program control I/O transfers at any 

time, the programmer must be sure that the 
type of transfer he performs is consistent 
with these bits of CTL for the transfer 

to be meaningful. 


in 

13-14 Format Type 

ge 32-bit integer. No format conversion. 

ra, Used. to transfer integers or program 
half-words. 

ie 16-bit integer. 16-bit integers from Host 
are converted to unnormalized 38-bit AP-120B 
FPN's. Low 16-bits of AP-120B FPN are sent 
to Host. | 

2 Conversion of "Sign-magnitude mantissa with 

| binary exponent" format to/from AP~120B 
Floating Point format. Includes logic to 
handle "Phantom bit" formats. 

3 Conversion of IBM 32-bit format to/from 
AP=-120B format. IBM format can be specified 
to have either sign-magnitude or 2's comple- 
ment mantissa. 

Note: For format types 2 and 3, the format register 


HDMA Start / 
Busy | 


has the necessary logic to detect overflow 
and underflow on conversion from AP-120B 
format and to force a signed maximum quantity 
on overflow or oars zero on 
underflow. 


Host DMA Start. Initiate DMA transfers CO/ 


frcm Host memory. When read the state of 
this bit reflects the status of the Host DMA 
activity oC! ds tf active, 90" Ti anacrvive). 


Transfers continue. until NC=@, 


4.4 FORMAT CONVERSION REGISTER 


This 38-bit double-buffered register. is used for all 
transfers of floating-point numbers (FPN's) between the Host | 
and the AP-120B. It also provides the most efficient path for 
transfer of micro-code half-words (32-bits). It performs bi- 
directional format conversions under the direction of Bits 9, 
10, 13 and 14 of the CTL Register. The programmer must be aware. 
of the fact that the Format Conversion is a slave to these CTL. 
Bits. Transfers to and from the formatter must be consistent. 
with these bits or nonsense will result. The Host and AP-120B 
can read the output of the formatter at any time without re-_ 
striction, however, the input to the formatter is controlled 
by CTL bits 9 and 10. 


CTLOY CrLLoO: ~ Input Path to Formatters 


oe 208. Host Data Bus 
0 1 | AP-120B I/O Bus 
ni oil ~AP-120B Main Data Output 


The formatter has a ready indicator that can be sampled by the 
AP-120B. This indicator tells the AP-120B when to load new 
data into the formatter (CTL10=1) and when to read data from it 
(CTL10=@) after the Host has finished reading or loading the last 
(‘16-bit word of an FPN. 

Note that in 16- bit Host Computers, the interface expects 
to receive words in different order depending on CTL Bit 12 
(DECHMA). If Bit 12 is clear i.e., the Host DMA interface is 
going through memory in. forward order from low to high addresses, 
then the interface expects to receive the high word of an 
FPN followed by the low word. If bit 12 is set, the interface 
expects to receive the low word followed by the high word. This 
is done so that arrays of FPN's are always stored in forward 
order in Host memory. | | 

If the Format Bits of CTL (Bits 13,14) specify a 16-bit 
transfer (FMT=1) then the integer is loaded and read from the 
low word of the formatter. And that word is considered to be 
the last word transferred. . 
. There is no corresponding indicator to the Host since the 
AP-~120B can transfer data to and from the formatter faster than 
most HOST processors. The DLATE bit in the CTL Register (CTL 
Bit 7) does indicate when an error of this type occurs, i.e., 
when the Host transfers data faster than the AP-120B. 


> 35. AP-120B INTERNAL INTERFACE TO HOST INTERFACE 


- ‘The Registers in the Host interface are accessible to | 
the AP-120B via its Input/Output (1/0) instructions (FADD=7). 


AP— 120B Device Addresses for Host 
Pare eeece Registers 


Register DA - 


WC p 
HMA i 
CTG 2 
APMA 3 
FORMAT 4 


An IN, OUT or SNSA instruction at DA=4 (FORMAT) will 
generate an IODRDY Response if the FORMAT Register is ready 
to accept data from the AP-120B (CTL Bit. 19='1') or if it 
has formatted data readv for the AP-120B (CTL Bit 19='9'). 

If CTL Bit 9 is a '1', the AP-120B cannot load the formatter 
via I/O instructions since the input multiplexer to the 
FORMAT Register will be set to select MAIN DATA instead of 
the AP-120B I/O Bus. Note that the AP-120B cannot change the 
state of CTL Bit 5. If it attempts to set this bit when the 
bit has been previously set by the Host, an interrupt of the 
Host will be generated. The AP-120B can read the CTL at any. 
time without interfering with the HOST interface. If both 
the Host and the AP try to write CTL or access HMA, WC or 
APMA at the same time, the Host selection and data has prior- 
ity over that of the AP- 120B. 

| Access to the Format Conversion Register is ‘controlled 
by CTL Bits 9, 10, 13 and 14. See Section 4.3, Format Con- 
version Register, for a description of the function of these 
bits. | . 


4.6 AN EXAMPLE, LOADING PROC AMS INTO THE AP-120B 


Loading and running a program in the AP-120B from a "cold 
start" is a five step DEOcESS which nicely illustrates 
use of the front oe 


a Through the AP front panel from the host computer 
—  . we "finger switch" in a three instruction 
bootstrap program into Program Memory. 


2. - We start the bootstrap running. 


3. Set the address where we want the loaded program 
to go in the AP- 120B. 


4. We start with a DMA transfer of program words . 
from host computer memory to the AP-120B. The boot- 
strap program running in the AP-120B stores these 
words: into Program Memory. 


a. When the DMA transfer is done, we stop the boot- 
; strap program in the AP- 120B; and then restart 
the AP-120B executing our newly loaded program. 


These five steps are detailed below. DMA control and front 
panel interrogation is done from the host computer by 

setting various interface registers. The actual host 

computer I/O instructions to accomplish this, of course, 

depend upon the particular host computer. Section +) describes 
the proper 1/0 instructions for a number of host computers. 

For the purposes of this explanation, we merely indicate 

what numbers get loaded into which interface register in 
order LO accomplish the desired gon | 


Step 1 


For the purposes of this example, we are going to put 
_ the bootstrap program into Program Source Memory locations 
9, 1 and 2. 


ae Set TMA to @ (TMA is the pointer used by the 
panel functions for nent ns ue or Gopo ELne into 
Program Memory): 


«9 + SWR Put @ into the switches. 
1003 + FN | Put 1003 into the Function register 
| | (causing a deposit into TMA) 


2. Put bits 0-63 of bootstrap program program word 
#1 into Program Memory location 9 using four de- 
posits of SWR + PStma. 


(bits 0-15) * SWR Put bits 0-15 into the switches. 
LOLO = EN - Put 1010 into the function register 
| (causes a deposit into bits 
0-15 of eee 


4-14 


(bits 16-31) + SWR Puts bits 16-31 into bits 


1030 +FN VG=3t-O8 PSa as 
(bits 32-47) + SWR Puts bits 32-47 into 32-47 | 
‘1050 + FN SOL Swans 


(bits 48-63) > SWR Puts bits 48-63 into bits 
1370 + FN , 48-63 of PS»ya and increment 
| - TMA to point to location 1. 


3. Repeat number 2 above bootstrap program words : 
#2 and #3. 


The above sounds worse than it really is, and besides, we 
only have to do it once (unless we clobber the bootstrap 
turn off power). 


step 20 


Set the address in the AP-120B Program Memory where we 
want our program to get loaded by the bootstrap into TMA. 
Or this example, this address shall be 200: ; 


200 + SWR 3 Put 200 in the switches. 
1003 + FN Put 1003 into the function 
13 register (cause a deposit 
into TMA). 


Step 3 
Start the bootstrap program running in the AP-120B. 
1. Set the switches to 9 and do a start. 


Q + SWR 
40000 + FN Start the AP-120B at location @ 


The bootstrap program (as we shall see below) will sit 
and "spin'', waiting for words to come across the DMA from 
the host computer. . 


7 4 


Start the DMA transfer going from host memory into the AP-120B. 
For this example, we will assume that the program we wish to 
load is sitting in host memory at location 20000. We will 
further say that the program to be loaded is 200 AP program 
words (or 800 16-bit host words) long. The actual host memory 
location and length, could, of course, be any particular value. 


20000 + HMA. 7 Set host DMA address to 20000. 
800 > WC. 7 Set word count to 800 host 
words (assuming a 16-bit host 
| | | word width). 
204 + CTL ' Start the DMA going. 
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Note in particular, the "CTL" bits. Bit 15 starts things 
going and bit 8 peqnes consecutive memory: cycles from 

the host. By not setting bits 10 or 11 we set the transfer 
to go to the AP- 120B, but not into Main Data Memory. 
Instead, the data goes only as far as the formatter which 
our bootstrap will read. If we had set bit 4, the host 
computer would be interrupted when the DMA is done. As 
things are, we will keep rasa ‘simple and not use the 

host interrupt, 


Step 4.5 


And now, finally, we get to our three word bootstrap 
program PeBneee in the AP-120B: 


Q. LDDA; DB=4 | "S8t DEVICE ADDRESS to 4. 


This instruction sets the Device Address register, so that 
future I/O instructions will refer to device #4, which is 
the DMA TOPMALLCE (where the data from the host computer 
ends up). 


1. LOOP: SPININ: "Wait for some data 
, DB= INBS: 'Get the data 
LPSLT "Put it into the left half of P.s. 


The "SPININ" causes the processor to hang until the current 
I/O device address (in our case, the DMA formatter) has some 
new data; and then to read that data the "DB=INBS" puts the 
input data onto the Data Pad Bus. The "LPSLT" puts what is 
con the Data Pad Bus into the left half (bits 0-31) of the 
Program Memory location pointed at by the TMA register. 


Two points here: 1) The formatter is 32-bits wide on 

the AP-120B end. Every time the interface has gotten 
32-bits of data from the host computer the "SPIN" stops 
waiting and we have another 32-bits of data. Since the 
program words we are loading are 64-bits wide we get them 
in. Daives: (tert, right, tett, rien, etc.) and store them 
accordingly Lac Program Memory. 2) We used TMA as a 
pointer to where our bootstrap.would put the program it 
is loading, so the ."LPSLT". puts the proeten words into 
the DEORE” place. 


2. SPININ; | | "Wait for data « 


DB= INBS "Get the data .- 

-LPSRT; | "Put it into the ‘right half 
ENCTIA "Tnerement our pointer - 

“BR LOOP: "GO back LO more 


This does basically the same as #1 above, except that. here 
we have the right half (bits 32-63) of a 64-bit program 
word. The "INCTIMA™ increments our “storing” pointer 

so instruction #1 will store its data into the next word. 
The branch keeps using loop,. insatiably waiting for more 
program half-words. or a 4 
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‘Step 5 


Back in the host, we wait Hee the DMA ivaneror to be done 
by: 


1. Read ee? CTL register | 
2. Test for bit 15 (the LSB) equal to a 
3 If so, go back LO: Lis 


We could, of course, _ have enabled a host interrupt on DMA 
completion, . 


When DONE, we stop the bootstrap program (which otherwise 
would run forever) with a panel RESET function; and 

then start our newly loaded program, (our example starts at 
location 200): , 


40000+ FN RESET the AP-120B 


200 > SWR | . Our new program address 
1000 + FN Set 200 into PSA 


20000+ FN | Continue (from 200), i.e.,.start 
* | at A.P. location 200 


Had we wished to set a program breakpoint, we would have 
set the breakpoint address into the SWR and used 20400 
(continue +t break on PSA) for our final panel function. 


Postscript 


The simplest way for the AP-120B program that we have now 
set running to indicate to the host computer that it is 

done with its task is to HALT. When this happens bit @% in 
the Panel Function Register will come on, which the host 

can test for; or a host interrupt can be enabled (CTL bit 3). 


SECTION 5 


- PARTICULAR HOST INTERFACES 


, This section describes the particulars of several 

host computer / AP-120B interfaces. in. general, the inter= 
face consists of eight registers which are accessable to 
either the host computer or the AP-120B. Three of these . 
registers comprise the AP-120B "front panel", while the 
other five control the DMA (Direct Memroy Access) connection 
between the two processors. 
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Figure 5.1 General AP-120B/Host Computer. Interfaces 


Ths host computer controls the AP-120B by loading and 
reading these eight registers Thus, the AP-120B may 
be likened to a "smart" disc or tape unit, responding 
to programmed I/O instructions from the host to initiate 
desired tasks. eS 4 


The exact effects and significance of these eight 
registers are described in detail in Section 4 of this 
manual, which explores the AP-120B side of the inter- 
face. This section details how particular host computers 
access the interface registers, and hence control the 
-AP-120B. . 


9.2 DEC PDP-11. INTERFACE 
5.2.1 Interface Register Address: 


Unibus Address Interface Register 


AP+100 | Word Count (WC) 

AP+102 Host Memory Address (HMA) 

AP+104 DMA Control (CTL) - 

AP+106 AP Memory Address (APMA) 

AP+110 | Panel Switches (SWR) | 

APrili2 Panel Function (FN) 

AP+114 Panel Lites (LITES) 

AP+116 | ‘Reset (Same as Panel Reset Function) 
APs . Pormatter . 


The base. address "AD" is strappable between 174000 and 

LTT60G 

5.2.2 Comments. HMA is an a bit, even byte address that 

' increments or decrements by 2. WC counts 16-bit PDP-1l 
word transfers. — 


Word format (CTL bits 13-14) type 2 refers to ‘PDp- ll Fortran 
format (8-bit biased binary exponent, sign-magnitude mantissa 
with "hidden" nee Format type 3 refers to IBM 360 32-bit © 
format. | 


In consecutive cycle mode (CTL bit 08 set) the PDP-1l 
DMA interface will steal a hardware selectable number (1 to 15) 
of consecutive nee cycles and then release the Bus. 


This-is done so that other time- CPLeEeee DMA devices can 
get access to the bus. 


5.2.3 Reading/Writing of Host Memory Address. The two high 
order bits of the Host Memory Address (HMA) register bits 


are set via an output to the Panel Lights (LITES) address, and . 
read with an aPoue from the Reset Address: 


MOV HIGHBITS, AP+114 (Move to LITES) — MOV pence tds AP+102 (Move to HMA) Gia 
; oading 


Ba = 
11] LO} 9) 81 7} 615}473) 241 From 
the 


15 ia| 13} 22/11) 
i PDP-11 
ran | 


; cae ot 
15} 14) 13}12] Li} LO} 9} 8) 7] 6} 5] 4) 3) 2-1] O| - Host Memory Address 
: bon | (HMA) 


14; 13 12 


Reading 
15{ 14/13| 12] 12 19] HMA 
ante 
: : : che 
3 (2 fr PDP-11 
Mov AP+116, HIGHBITS (Move from RESET) MOV AP+102, LOWBITS (ove from a) DP-1 


NOTE: The move to the LITES aldress (AP+114) does not affect 
the LITES register. Similarly, a move from the RESET address 
(AP+116) does not cause a Reset of the AP-120B. 


An example: we wish to set HMA.to 532706: 


MOV #132706, APt102. ; SET 12706 into HMA 
MOV #100000, AP+114 ; ADD 400000 into HMA 


and to read out of HMA this value we have set: 


MOV AP+102, LOWBITS ; would get 12706 
MOV AP+116, HIGHBITS ; would get 137777 


9.2.4 Program Access of the Format Register. The AP-120B 
Formatter is 32-bits wide, and hence is accessed by the 
PDP-11. into 16-bit halves. The Formatter may be read/written 
at any even address between "AP" and AP+76. If DMA control 
(CTL) register bit 12 is clear (host address increments). 


successive high, low, high, low,... halves of the Formatter 
are accesded. If CTL bit 12 is set (host address decrements) 
the sequence is low, high, low, high... In both cases, high 


refers to the first 16 bits of the floating-point ntmber 
of 32 bit integer, and low refers to the last 16 bits. 


When 16-bit integers are involved (CTL bits 13-14 set to 
"Ol") only 16-bit entities are involved on the PDP-~11l end. 
The access then is simply by words: word #1, word #2, word #73... 
5.2.5 Examples. 


1) Stop the AP-120B and examine PSA (Program Address); 


- MOV #100000, AP+112 ; (STOP the AP-120B) to FN 
MOV #2000, AP+112 © -; (EXAM PSA) to FN 
MOV AP+114,. APSA -. 5 Get the answer (from the LITES) 


2) ‘Start the AP-120B at program location 20, setting 
a breakpoint at PSA=200: 


MOV #20, AP+110 ; Put 20 into SWR 
MOV #1000, AP+112 ; (SWR + PSA) to FN 
‘MOV #200, AP+110 . ; 200 to SWR . 
MOV #20400, AP+112 ; (CONTINUE + BREAK on PSA) to FN 


3) Initiate a DMA transfer of 100 PDP-11l format 
floating-point numbers from location 20000 in the 
‘PDP-11 to location 2000 in the AP- 120B Main Data 
Memory: 


MOV #200, AP+100 =; Set WC to 200 PDP-11 words 
MOV #20000, AP+102 Set HMA to 20000 

MOV #2000, AP+104 ‘Set APMA to 2000 

MOV #305, AP+106 | Start the DMA 


wewe wow 
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5.3 DATA GENERAL NOVA/ECLIPS! INTERFACE 
53k I/O Instruction Assignments: 


[I/O Instruction — Interface Register 


DOA/DIA AP1 ~ Word Count (WC) 
DOB/DIB AP1 Host Memory Address (HMA) 
DOB/DIB AP1 DMA Control (CTL) 
DOA/DIA AP2 : AP-120B Memory Address (APMA) 
DOB/DIB AP2 Formatter high word (FMTOO-15) 
~ DOC/DIC AP2 Formatter low word (FMT16-31) 
DOA/DIA APG - Panel Switches (SWR) 
DOB/DIB AP@ Panel Function (FN) 
/DIC APO Panel Lights (LITES) 
Other I/O Instructions Effect 
NIOC AP@ Clear AP eae pecuese 
NIOS APO Reset (Same as RESET panel function) 
MSKO — Set interrupt mask (bits normal) 
SKPBZ AP? . . Skip if AP halted 
SKPDN AP@ a Skip if SREenEURe request penctne 


Responds ath Device Code "AP@" to INTA from the Nowa, 


Device codes APg, AP1, and AP2 must lie within a single 
octade of device codes. 


5.3.2 Comments. HMA increments and decrements and WC 
decrements, for each 16-bit Nova word and transfer. 


Word Format (CTL bits 13-14) type 3 is Nova single- 
precision floating-point format (32-bit IBM sign- : 
magnitude mantissa, 7-bit hex exponent). Format type 
2 is not used. 

51345 ‘Examples: 
1) Stop the AP-120B and examine PSA (Program Address): 


LDA @, 7100000 


DOB 9, AP ; (STOP the AP-120B) to FN 

LDA 9, =200 oe | 

DOB 9, APO ; (EXAM PSA) to FN 

DIC 1, APQ 3: get the answer (in LITES), into AC1 


ZY Start the AP-120B at vrogram location 20, setting 
a breakpoint at PSA=200: 


DOA @, AP ; Put 20 into SWR 

LDA 9, =1000 . 

DOB @, APO . ; (SWR. + PSA) to FN 

LDA @,. =400 . 3 

DOA @, APO. . ; Put 200 into SWR 

LDA 9, 729400 . | | 

DOB @, AP® ; (CONTINUE + BREAK on PSA) to FN 
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3) 


Initiate a DMA transfer of 100 Nova floating-point 
numbers from location 20000 in the Nova to location 
2000 in AP-120B Main Data Memory: 


LDA 9, -=200 
DOA @. AP1 | > Set WC to 200 Nova words 
LDA $9, =20000 | 
DOB @, API ; Set HMA to 20000 
' LDA @, =2000 oe . 
DOA @, AP2 | ; Set APMA to 2000 
LDA 9, =306 | 
DOB 9%, AP1l ; Start the DMA 


5.4 RAYTHEON R704 1RDS-500 INTERFACE 


The AP-~120B RDS 500 interface acts as a controller 
for the data transfers between the AP-120B interface registers 
and the RDS 500 data busses, as described in Section 4 of 
the AP-120B's Processor Handbook. The interface consists 
of three units: the Direct I/O controller, the DMA interface, 
and the data transfer receiver. 


The DIO control recognizes the Pov iewias DOT/DIN | 
commands: 1 


DOT/DIN Function | | | 
Assignment Field Access Register 


il 7 - Panel Switch Register (SWR) 
2 a Panel Function Register (FN) 
3 - an -. Panel Lights Register (LITES) 


(Read Only) 
| DMA Word Count Register (WC) 


Host Memory Address Register (HMA) 


4 
5 
6 | | ~ Control Register (CTL) 
7 ~ AP Memory Address Register (APMA ) © 
8, A,C,E . | Format High | 
9, B,D,F Format ew 
DINO aa | | Read Simple Status Word 
DOTO Reset | 
Sete that the Panel Display Register (PDR or LITES) can only 
be read by the host. Attempting to load this register results 
in-a no-op in the DIO control. 
Executing a DIN instruction with the function assignment field 
equal to zero reads a status word on to the DIN bus, true status 


is indicated by a one in the appropriate position. The status 
bit eso teament is as follows: 


DIN JBAt- « . Assigned Status 
| Q 4 : AP running 
al 8 | ap DMA request present 
15 3 3? - Error (Data late or Format 


Conversion Error) 


2-14 a Unused 


Bit 15 is the result of an inclusive OR of control 
register (see Section 4.2) bits 6 (format error) and 7 (data 
late). Bits 2-14 are not used and are always zero when the 
status word is read. | 


| Dot 9 is used as a reset command. Executing this 
instruction clears DMA timing and pending requests. | 


The device address that the AP-120B DIO control will 
respond to is selectable by a hexadecimal DIP switch mounted 
on the interface card. Any address from 9 - 15 present o 
DAD lines 8 - 11 can be selected. ; : 


DMA Operation is a slave to the state of the control 
register. Loading the control register with bit 15 set 
(HDMA start) initiates DMA transfers with mode and direction 
determined by CTRL bits 8 - 14. Bit 8 (consecutive cycle) 
selects the burst mode. When set, DMA transfers occur 
in blocks of up to 16 consecutive cycles, after which the 
DMA request line is released for one cycle to allow other 
devices access to the DMA channels. The actual number of 


- eyceles, (executed before the request line is freed) is selectable 


by a hexadecimal switch located on the interface-card. When 

the consecutive cycle bit is clear, the request line is released 
for one cycle after executing one transfer. DMA transfers continue 
until the word count register reads zero. The word count is 

loaded with the actual number of 16-bit words to be transferred. 

If loaded with zero, however, one 16-bit transfer will occur. 


Jumpers for the memory request (MRQ1 — MRQ8), memory 
acknowledge (MAK1 -—- MAK8), memory write (MTW1 - MTW8), and inter- 
rupt request (IRPT @@ - IRPT 15) lines are located on the 
AP connector panel. Selection of the desired priority is made. 
by placing jumper plugs in the appropriate positions. The 
connector panel is clearly labeled and is visible from the 
rear of the unit. . 


The interface is equipped with a transceiver capable 
of accepting data in four different formats (outlined in 
Section 4.2 Direct Memory Access). Since the control register 


bits 13 and 14 specify the format to be used when loading the 


formatter under both DMA control and DIO functions 8 and 9, 
data transfers must be consistant with the state of these 
bits or nonsense will be output from the formatter. Data 
transfers that do not use the formatter will not be affected 
by the state of the control register (DIO funations 9 to 7). 


Caution must be exercised when inputting floating-point 
numbers (FPN). In IBM 32-bit format, the high part of the 
formatter (DIO function = 8) refers to the sign, exponent, 
and high mantissa of the FPN, while the low part (DIO function = 9) 
refers to the low mantissa (see figure below). As an example, 

a DOT with a function assignment field of 8 would load the sign, 
exponent and high mantissa of the formatter. On the other hand, 
when in Raytheon real two-word floating-point number format, 
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high format refers to the mar!issa least Significant bits 
and the exponent, while the i-.v word of the format is © 
taken to mean the sign and mantissa MSB's. In this case 
then, a DOT with function assignment of 8 would load the 
format mantissa LSB's and exponent. 


FMT HIGH —— Mantissa LSBs 


Raytheon Real 
Two-Word 


FMT LOW 


Mantissa MSBs 
FMT HIGH. | Exponent Mantissa MSBs | 
: _| IBM 32 bit floating 
FMT LOW Mantissa LSBs point 


32-bit integer format is a straightforward load of the high 
Significant bits into FMT HIGH and low significant bits into FMT 
LOW. 16-bit integers are loaded into the FMT. LOW. 


5.5 TEXAS INSTRUMENTS TI980 INTERFACE 


The AP-120B to TI980 interface acts as a controller for the. 
data transfers between the AP-120B interface registers and the 
TI980 as described in Section 4 of the AP-120B Processor Hand- 
book. The interface consists of three units: the Auxiliary | 
Processor Port Controller (APP Controller), the Direct Memory 
Access Channel peter rece (DMAC Interface), and the Data Trans- 
ceiver. . 


5.1.1 APP Controller. The TI980 Auxiliary Processor Initiate 
Instruction (API Instruction) format. is: 


0 7 78 ; 15 

| DD16 _ DEV FIELDS 

0 | 15 
API-2 | DEV FIELD | 


The first eight-bits of the API-1 word are decoded by the 
TI980 as the first word of an API Instruction. This causes the 
TI980 to send this instruction word, and the next sequential 
word in the TI980's central memory to the AP- aan through the 
APP, 


After API-1 has been sent through the APP its other eight © 
bits will be looked upon as having two separate fields. Bits 8,9, 
.and 10 are labeled DC in the diagram below and used as a device 
code. Bits 11 through 15 are labeled IC and used as the AP-120B 
APP Controller's instruction code. The APP Controller will com- 
pare the device code contained in API-1 to.the device code it 
has been wired to accept. The IC field will be taken as an 
instruction for execution by the APP Controller only if the DC 
field matches the AP-120B's APP Controller device code. 


: APT-~2 is labeled DATA ADDRESS in the diagram below and is 
used for this purpose if the DC field in API-1 matches the device 
code of the AP-120B's APP Controller. DATA ADDRESS is program 
relative when the TI980 is operating in User Mode and absolute 
when the TI980 is operaetne in ope tenor Mode. 
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The AP- 120B's APP Controller may be wired to accept any 
device code of 0 through 7. 


3G) .. LO”, Ud 15 


wes [ DATA ADDRESS iu 


DC-APP Controller Haeies Code. The AP-120B's APP Controller 
can be strapped to user- -specified codes of 0 bhrougs ree 


~ IC-AAP Controller Instruction Code. 


5.5.2 APP Controller Instructions. Each of the AP-120B's 
Controller Instructions is described below. A summary is 
given after the description of the last instruction. re 
numbers are given in radix 16 notation. 


Load Switch Register 


LSWR (IC=1) 
DATA. ADDRESS 


The AP-120B's Switeh Register (see AP~120B Beeueegn 
Handbook 4.1.1) is loaded from the TI980 central memory at loca- 
tion DATA ADDRESS. 


Load Function Register 

LFN (IC=2) 

- DATA ADDRESS 

The AP-120B's Function Register (see AP-120B Processor 


Handbook 4,1.3) is loaded from the TI980 central memory at 
location DATA ADDRESS, 


Load Word Count Register 


LWC (IC=4) | 
DATA ADDRESS 


The AP-120B's Word Count negates (see AP- 120B Processor 
Handbook 4.2.2) is loaded from the TI980 central memory at 
location DATA BEDRESSs -~11 


Load Host Memory Address ‘agister 


LHMA (IC=5) 
DATA ADDRESS 


The AP-120B's Host Memory Address Register (see AP-120B 
Processor Handbook 4.2.1) is loaded from the TI980 central 
memory at location DATA ADDRESS, When using this instruction 
TI980 central memory location DATA ADDRESS should contain the 
absolute address in the TI980's central memory from which DMA 
transfers are to be made. | 


Load Control Register 


-LCTL (1C=6) 
DATA ADDRESS 


| - The AP-120B's Control Register (see AP-~120B Processor 
Handbook 4.2.4) is loaded from the TI980 central memory at 
location DATA ADDRESS, | 


Load AP Direct Memory Address Register 


LAPMA (IC=7) 
DATA ADDRESS 


The AP-120B's Direct Memory Address Register (see AP-120B 
Processor Handbook 4.2.3) is loaded from the TI980 central memory 
at location pata ADDRESS, | | 


Load Format Conversion Register High 


LFMTH (IC=8) 
DATA ADDRESS 


The most significant 16-bit TI980 computer word of the 
AP-120B's Format Conversion Register (see AP+120B Processor 
Handbook 4,3) is loaded from the TI980 central memory at 
location DATA ADDRESS. This path is intended fer diagnostic 
use only, it is the programmer's responsibility to see that 
APP Controller and DMAC Interface data transfers do not 
contlict. | : 7 | | 


Load -Format Conversion Register Low 


LFMTL (IC=9) 
DATA ADDRESS 


The least significant 16-bit TI980 computer word of the 
AP-120B's Format Conversion Register (see AP-120B Processor 
Handbook 4.3) is loaded from the TI980 central memory at 
location DATA ADDRESS, This path is intended for diagnostic 
use only, it is the programmer's responsibility to see that 
APP Controller and DMAC Interface data transfers do not 
conflict. 


Load Host Memory Address Register Biased 


LHMAD (IC=D) 
DATA ADDRESS 


The AP-120B's Host Memory Address Register (see AP-120B 
Processor Handbook 4.2.1) is loaded from the TI980 central 
memory at location DATA ADDRESS. When using this instruction 
TI980 central memory location DATA ADDRESS should contain 
the program relative address in the TI980's central memory from 
which DMA transfers are to be made. 


Reset 


RESET (IC=10) 
DATA ADDRESS 


The AP-120B's DMA Timing and Memory Timing are Reset. 
The AP-120B's S~Pad Register O and Status Register are cleared. 
The RUN BIT in the AP-~-120B's Function Register is Reset. Ones 
are loaded into the TI980 central memory location DATA ADDRESS. 
This instruction performs the PuneL TOG of an AP— 120B machine 
reset. 


O1 


HLS 


Read Swrteh Reepst Cr 


RSWR ( TC=11) 
DATA ADDRESS 


The contents of the AP-120B's Switch Register (2G AP-120B 
Processor. Handbook: 444131 ) is eared inte the TLOSso0: central 
momoryv at location DATA ADDRESS, 


mead Furclion. Rewister 


REN. CECo 2) 


baad 


DATA ADDRESS 


The contents of the AP-120B's Function Register (soe 
AP-129B Processor Handbook. 4.1.3) is stored into the TIOSO 
Central Menory. 26° l6ocaricn DAarih ADDRESS, 


Read Panel Display Reeister (LITES) 


APD. (eC 


=13) 
DATA ADDRES 


) 
5 


The contents of the AP-120B's Panel Display. Becister 


(see AP-120B Processor Handbook 4,1.2) is stored into the 
Piss0 central memory at location DATA. ADDRESS. - There it< no 
Instruction for loading the Panel Display. Register from the 


TIVSO'S contral memory. 


+ 


Read Word Count Register: 


RNC (IC=14). 
DATA ADDRESS 


The contents of the AP-120B's Word Count Register (see 


AP-120B Processor Handbook 4.2.2) is stored into the TI980 
central memory at location DATA ADDRESS, - 


Read Host Memory Address Rovister 


RHUMA (TC=15) 
DATA ADDRESS” 


The contents of the AP-120B's Host Memory Address Register 
(see AP-120B Processor Handbook -1.2.1) is stored into the 


~TIO8O central nomery at 1oeatianw DATA ADDRESS. 
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Read Control Register 


 RCTL (IC=16) 
DATA ADDRESS 


The contents of the AP-120B's Control Register (see 
AP-120B Processor Handbook 4.2.1) is stored into the TI980 
central memory at location DATA ADDRESS. 


Read AP Direct Memory Address Register 


RAPMA (IC=17) 
“DATA ADDRESS 


The contents of the AP-120B's Direct Wnox Address 
Register. (see. AP-120B Processor Handbook 4.2.3) is loaded from 
the TI980 central memory at location DATA ADDRESS, 


Read Format Conversion Register High 


RFMTH (IC=18) 
DATA ADDRESS 


The contents of the most significant 16-bit TI980 computer 
word of the AP-120B's Format Conversion Register (see AP-120B 
Processor Handbook 4.3) is stored into the TI980 central memory 
at location DATA ADDRESS. This path is intended for diagnostic 
use only, it is the programmer's responsibility to see that 
APP PeREe Ones and DMAC Interface data transfers do not conflict. 


Read Format Conversion Register Low 


RFMTL 
DATA ADDRESS 


The contents of the least significant 16-bit TI980 computer 
word of the AP-120B's Format Conversion Register (see AP-120B © 
Processor Handbook 4.3) is stored into the TI980 central memory 
at location DATA ADDRESS. This path is intended for diagnostic 
use only, it is the programmer's responsibility to see the 
APP Controller and DMAC Interface data transfers do not coniiict, 


5.5.3 Summary of APP Instruction 


Instruction Instruction Instruction 


. Mnemonic Code (1Cig) Function Notes 
LSWR | 1 (DA) + SWR 
LEN | 8 (DA) + FN 
-Lwe 3 4 (DA) + WC 
LHMA = 3 (DA) + HMA. Absolute Memory Address 
ECT: 6 (DA) + CTL 
LAPMA 7 (DA) + APMA 
LFMTH 8 (DA) + FMTH Diagnostic Path 
 LPMTL 9 (DA) + FMTL Diagnostic Path 
LHMAB D (DA) + HMA —- Program Relative Address 
RESET Pt “FFFF + DA -AP-120B Machine Reset 
RSWR 11 (wR) DA 
RFN 1 (FN) + DA 
RPDR | | 13° (PDR)+.DA 
RWD | 14 (WC) + DA 
RHMA 45. (HMA)> DA 
RCTL 16s (CTL) DA 
RAPMA ; 17  ~. . (APMA)> DA 
RFMTH i (atsti‘éi~d‘B | (FMTH)* DA Diagnostic Path 
RFMTL 19 (FMTL)+ DA Diagnostic Path 
NOTES: 
(1) Da A location in TI980's central memory whose 


address is the 2nd word of an. API Instruction. | 
(2) ( ). -The contents of 
(3) = Stored into 
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oes DMA Interface. DMA transfers to and from the AP-120B 
are ere is by the contents of the four DMA registers in the 
AP-120B interface (see the Array Processor Handbook, Section 
4.2). The initialization procedure for a DMA transfer calls for 


loading these registers via eee taTy Processor Initiate 
Instructions. 


DMA operation is 4 slave to vas” state of the Control Register 
CCT Mode and direction of the transfer is determined by 
Control bits 8-14. Bit 8 selects the consecutive cycle mode. 
“When set, DMA transfers occur on consecutive host memory cycles, 
locking out the host CPU for the length of the transfer. When 
clear, the access request is dropped for one cycle after each 
word is transferred to allow the CPU access to memory. DMA 
transfers occur until the Word Count (WC) Register reaches 
zero. The WC Register is loaded with the number of 16-bit 
words to be passed. Setting bit-15 of the CTL Register 
(HDMA start.) starts the transfer. Because of this, the CTL 
register is always loaded Lest. 


Jumpers for access ‘request (ARDEV, O-7), access granted (AG,0-7), 
interrupt request (INTDEV,0-7), and interrupt acknowledge (IRECDSG, 
0-7), are located on the AP-120B DMA buffer cards. Selection 
of the desired priority is made by placing jumper plugs in the 
appropriate positions. 


AP-120B generated interrupts are controlled by CTL register 
‘bits 3-5. When the interrupt condition arises an interrupt 
request is generated by the interface. A status word is 
stored via a normal DMA transfer after the interface receives 
interrupt recognition from the host. The Status Word Format 
is shown below. The conditions are true when the respective 
“pits are set to one. 


bit-0 (MSB) - Word Count equals 0, Indicates 
DMA transfer completed. 


bit 14 — DMA transfer error 


bit 15 (LSB)- AP-120B is halted. Micro-code execution 
—- is terminated 


bits 1-13 ~ always zero 
The address where the status word will be stored in the host 


memory is selectable by ae options on the AP-120B ‘nteneaos 
card, . 
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Or 


5.5.5 Data Transceiver. The Data Transceiver is capable of 
accapting data in four differen: formats. Format selection. 
as accomplished by bit 13 and 14 of the AP-120B's Control 
Reyister (See AP-120B Processor Handbook 4.2. 4). When loading 
the Format Conversion Register of the Data Transceiver from 
the DMAC or APP data transfers must be consistent with the 
state of bits 13 and 14 of the AP-~120B's Control Register or 
NONSENSE will be output from the’ Data Transceiver. 


In 32-Bit Integer Format (CORT Bits 13 and 15=0) the high 
parts of the Format Conversion Register (APP Controller 
Instruction is IC=8/18) refers to the TI980 data word of high 
Significance. The low part of the Format Conversion Register 
(APP Controller Instruction IC=9/19) refers to the TII80 data 
word of low Syeutenesnce.: : 


In 16-Bit. Integer Format (CRT Bits 13 and 15=1) TI980 
data words are loaded into or read from the low part of the 
Forward Conversion Register (APP Controller Instruction is IC=9/19). 


In TI 2-Word Format (CTR Bits 13 and 15=2) the. high part 
of the Format Conversion Register (APP Controller Instruction: is 
IC=8/18) refers to the sign, and the high mantissa bits of the 
floating-point number. The low part of the Format Conversion 
Register (APP Controller Instruction is IC=9/19) refers to the 
low mantissa bits, the exponent bias bit, and the exponent of 
the floating-point number. 


Example: | 
TI 2-Word Floating-Point Number 
ee OI. : 15 a 
FMTH | “MANTISSA MSB iz 
| | oe i 
| oO. 739 15 
FUTL | wavrrssa‘ise| EB 


EXPONENT 


| Ee 


In IBM 32- Bit fosmat CCTR: Bits: 13: and. 152 3) the high part 
of the Format Conversion Register (APP Controller Instruction is 
IC=8/18) refers to the sign, exponent in biased hexadecimal, 


and high mantissa bits of the floating-point number, The low 


part of the Format Conversion Register (ADD Controller Instruc- 
tion is IC=9/19) refers to ERS low mantissa bits of vue floating— 
point number. . 
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Example: 
IBM 32-Bit Floating-Point Number 
oa. 7 8 15 


FMTH |S uc EXPONENT .  MANTISSA MSB 
{ ; ; ¢ 
FMTL . = | MANTISSA LSB's _ | 
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5.6 VARIAN 620 INTERFACE 


The AP-120B to Varian 620 interface acts as a controller 
for data transfers between the \P-120B interface registers 
(Section 4 of the AP-120B Processor Handbook) and the Varian. 
620 "E" Bus. The standard interface is capable of transferring 
data under program control and also under control of the Buffer 
Interface Controller (BIC). 


Siok Input/Output Commands. Program Control access to the 
AP-120B required 9 device addresses. These addresses can be 
altered via hardware straps in the interface. They must how-_ 
ever, all be included within two octades (not necessarily con- 
tiguous) of device addresses. The standard device addresses are 
listed in the table below. ; - . 


Device | Register 

Address Accessed 
X7 Switch Register (SR) 

_ X6 Functional Register (FN) 
xo . Panel Display Register (LITES) Read only 
X4 . Word Count . 
X3 . Host Memory Address (HMA) 

' K2 Control (CTL) 

- XL AP-120B Memory Address (APMA) 
= Se, Format Register Low (FMTL) 
Y7 : Format Register High (FMTH) 


Where X and Y indicate the first octal. digit of the Device 
Address. | 


Note that the HMA Register is not functional when using the BIC. 
The BIC Initial Register provides the V620 memory address for 
BIC transfers. , 


5.6.2 SEC Commands. Two SENSE instructions are provided to 
allow the V620 program to test the state of the AP-120B. 


Function 

Field . Result | 
0 | Branch if AP-120B running | 2 
1 -Branch if AP-120B to BIC interface is 


active - 


These sense instructions use the same device address as does 
the AP- 120B Switch Register (SR). 


Thus, the ‘following instruction: 

LOOP, SEN, 0100+SR, LOOP - 
will wait for the AP-120B to complete a BIC transfer. 
+5, 6.3 EXC Commands. Five external control commands are pro- 
vided for initialization and control of the interrupts and the 
BIC. The Switch Register (SR) device address is used for these 
EXC instructions. 


maneries Control 


Field Function | 
re Reset AP-120B. Clears interface and 


memory timing... Stops AP-120B processor. 


1 | Enable the AP-120B BIC interface in the 
. - Word Count Stop Mode. In this mode the 
interface will stop the BIC transfer when 
the AP-120B WC Register reaches zero. 


20 . Enable the AP-120B BIC interface. Fol- 
| lowing this command the BIC transfer 
will terminate in the normal fashion, 
i.e., when the initial and final regis- 
ters are equal. | 


‘8 | Unconditional stop of the BIC transfer. 


4 Clear the AP-120B interrupt. Used to 
clear the interrupting condition after 
the software has responded to an AP~120B 
interrupt. 


5.6.4 BIC Operation. To use the BIC with the AP- 120B, the 

program must first initialize the BIC in the usual way by loading 
its initial and final registers and by issuing the activate BIC 
EXC instruction. The final register can be set to 0 if the trans- 
fer is to take place in the Word Count Stop Mode. The program then 
selects the interface mode with an | 


EXC ,100 + SR »Word Count Stop. 
or: an EXC , 200 + SR ,Normal BIC Stop 


Finally, the program starts the AP- 120B interface transferring 

by loading the CTL register with the appropriate command for the. 
type of transfer desired. Note that the consecutive cycle bit in 
the Control Register has no effect on the transfer. All transfers 
proceed at BIC “ePese: 


5.6.5 Floating Point Formats. The AP-120B to V620 interface 
can transfer floating-point arguments to and from the V620 in. 
two floating-point formats. Tbh> format desired is specified 
by the format select field of the CTL Register. ) 


EME on. 4 Format | 
SEL Description . 
20 — High word 15 14 (7.6 0 
| | S EXP MANT 
Low word 15 


po | MANTISSA | 


This is the V620 format, the exponent is 
biased, and the first word is one's © 
complemented if the mantissa is negative. 


a High word 15. 14. 2 8 7. 0 


MANT 


Low word. 


| MANTISSA _ | | 


This is the IBM-360 short form format. 
The exponent is biased and is a power of 
16. The mantissa is in sign-magnitude 
TOrm, 


2,606.6 Interrupts. A jumper is provided in the interface to. 
allow strapping any interrupt level from 0 to 7. 


5.6.7 Physical. The interface consists of a signal DM135 type 
card which mounts in the V620 mainframe or expansion chassis, 
and which is connected to the AP-120B via a cable of up to 10 
feet in length. . 


APPENDICES 
APPENDIX A: AP~120B REGISTERS/DATA PATH NAMES 


Mnemonic Width Name 


SP 16 bits Scratch Pad Registers (16) 
SPD 4 S-Pad Destination Address Register 
SPFN 16 Scratch Pad ALU/shifter function output 
PNBLS 16 Panel Bus 
SWR 16 Panel Switch Register 
LITES 16. Panel Display Register 
APSTATUS 16 AP Status Register 
PS 64 Program Source Menory 
CB 64. — Command Buffer 
PSA . “eo EOE ram Source Rage Ses Register 
SRS . “Ee +4 Subroutine Return Stack 
SRA 16 Subroutine Return Stack. Pointer 
DPX  .38 . Data Pad X Registers (32) 
DPY 38 Data Pad Y Registers (32) 
DPBS 38 Data Pad Bus 
DPA 16 | Data Pad Address Register 
T™ ; 38 Table Memory Output Register 
TMA 16 . ' Table Memory Address Register 
MD © 38 , Data Memory Output Register 
MI . 38 Data Memory Input Register 
MA 16. : Memory Address Register | 
Al 838 Floating Adder Input Register #1 
A2 638 | Floating Adder Input Register #2 
FA 88. Floating Adder Output Register 
M1 38 Floating Multiplier Input Register #1 
M2 38 Floating Multiplier Input Register # 
FM : 33. + Floating Multiplier Output Register 
IODEVICE i I/O Device 
DA 2) Ee I/O Device Address 
INBS . 38— I/O Input Bus. 
IODRDY L,, 7 I/O Data Ready Flag 
A. Lee | I/O Device Condition "A" Flag 
B i 7 I/O Device Condition "B" Flag 


Subscripts indicate addressing within memory element, i.e. 
means the location in Program Source Memory pointed to 
te Pena Program Source Address Register. 


Superscripts indicate portions of word, i.e. A2= means the 
exponent portion of the A2 Register. : 


Parenthesis around a symbol indicates "the doutenrs of'' a register, 
i.e. (Al) means the contents of the Al Register. 
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AP-120B INTERNAL STATUS REGISTER 


13 4614 15 
Bit Reverse 


Mnemonic. 


OVF 


DIVZ 


Meaning 


Set when the current adder or nupeipiies 
(FA or FM) has overflowed. Overflow 

occurs when an exponent value is increased 
above 511. The offending result is set to 
the signed maximum of value of (1-2727) * 
251. which is roughly 6.7 * 10153. This bit 


remains on until cleared. by the microprogram 


or host computer. 


‘Set when the current adder or multiplier 


result (FA or FM) has underflowed. Under- 
flow occurs when an exponent value is 
decreased below -512. The minimum legal 
magnitude which numbers can take without 
underflowing is roughly 3.7 * 107155. The 
offending value is set to zero. This bit 
remains on until cleared bythe microprogram 


| or host a 


A deyids by zero has occurred. The result 
waS set to the value of the dividend. This 


bit remains on until cleared by the micro- 


program or host computer. 


Set when the current adder result (FA) 
is zero. 


Set when the current adder result (FA) 
is negative. . 


Set when the current S-pad function (SPFY) 


‘is zero. 


_ Set when the current S-pad function (SPFY) 
is negative. 


S-Pad carry bit. If no S-Pad shift is 
specified, carry is the carry bit from the 
S-Pad ALU. If a shift is specified, carry 


is the last bit shifted off the end of the 


S-Pad result by the shift. 


Lo 


11 


12 


13-15 Bit Reverse 


PERR 


PENB 


SRAO 


IFFT 


FFT 


(Optional). St when a Main Data Memory parity 


error has occi:red. Three parity bits are 


used, one each to check the exponent, high 


mantissa, and low mantissa portions of the memory 
word. If "PEND" is set, the processor will 


ad. ‘on this error. 


(Optional). Enables halt on memory parity error. 
If set, the processor will halt when a memory 


parity error is detected. 


Subroutine return stack overflow. Set if 
more than 16 levels of nested subroutine 
calls have occurred. 


Inverse FFT flag. When set in conjunction 
with the FFT flag, bit 12, causes roots of 
unity table references to be interpreted 
as positive angles. . | 


FFT Flag. When set causes Table Memory 
addresses to be interpreted as negative 
angles referencing the roots of unity 


table contained in Table Memory. 


15-Log,N Where N is the length of a complex 
data afray to which the S-Pad address bit- 
reverse operator is being applied. 


AP-120B FUNCTIONAL UNITS, 


HOST COMPUTER | 1/0 DEVICE a FUNCTIONAL UNIT OUTPUTS: 
et | 


1/0 DMA A 


BUS INPUTS: 


DPBS - Data Pad Bus (38) 


DPX PS - VALUE 
DPY SPFN -— zERO 


DPX - Data Pad X Output 
Seed DPY - Data Pad Y Output 


MD - Vata Memory Output 
INBS SWR TM - Table Memory Ontput 
Mi). TM FA ~ FP. Adder Output 
INBS - Input Bus (38) Formatter: FM - F.P. Multipller Output 
PNLBS ~ Panel Bus (16) - §-Pad ALU Output. 
DPA PS PSA - Direct Memory Access 
MA TMA - Vrogram Source Output 
PNLBS - Input Bus 
: d 
PROGRAM - DPBS 4 
FA SPFN 
SOURCE PNLBS 
MEMORY a ” 
- DPBS 


Write 
Index 


S-PAD ALU FUNCTIONS 
S+D 
5?D 
S41+D_. “<j! 
S-1-D I 
D+#Se-D 
D-S+D 
D AND S-D 
D OR S-bD 
D EQV S+D 


TABLE 
| MEMORY 


S-Pad 


Nees. 


Data 
Memory. 


Index . 


FA FA 


” MD a TM MD 
DPX, DPY DPX ,DPY DPX, DPY DPX, DPY S-PAD SHITTER FUNCTION 
; VALUE 3 
Floating ML | M2- Floating hoe a ae 
Point Potnt Shifter 7, JA 
Multiplier: Adder: Foe ‘ N 
M1 *2 : AltA | bas Cc 
; oe A1-A2 Integer Condition Bits 
Stage 2 A2-Al . Floating 
stone 2 | ARS(A2) Point | SPEN 
Al Fav A2 Condition 
Stage 3 Al AND A2 Bits ae 
“Al OR A2 DPBS _iData Pad Address DPA 
FIX A2 SPEN 


Wily eee cee | 
DPBS ~ P| Memory Address ban 
SP EW. 


t 
{DPBS ~ 


Table Memory Address FY TMA 


iP s 
PNEBS vPro. Saurce Add ress ) PRA 


AP-i20B Instruction Summary 
Unconditional Fields 


_ Each of the following fields may be used in any 


Octal 


48-63 of this instruction). 
are. then disabled for this instruction word. 


given instruction word. 


The YW, FM, Ml; M2, MI, TMA and DPA fields 


ol 


Octal 
Code Field Name Code 
B SOP SOP1 SH SPS . SPD FADD FADD1 Al A2 
g NOP _SOP1 . NOP NOP (S-Pad (S-Pad FADD NOP NC “NC 9 
1 & SPEC WRYEXP  L Source Dest. - FSUBR FIX “FM FA 1 
2 ADD WRTHMN RR Reg.) Reg.) FSUB FIXT DPX DPX 2 
3 SUB WRTLMN R FADD FSCLT DPY DPY 3 
4 MOV NOP ' (0-17) (0-17) FEQV FSM2C TM MD 4 
5 AND NOP FAND F2CSiM ZERO ‘ZERO 5 
6 OR NOP FOR FSCALE ZERO \MDPX 6 
7 EQV NOP To FABS ZERO EDPX 7 
10 CLR 10 
11 INC lL. 
12 DEC 12. 
13 COM 13. 
14 LDSPNL 14. 
15 LDSPE 15. 
16. LDSPI 16. 
17 LDS PT 17 
Octal Octal 
Code ' Field Name Code 
COND .. DISP DPX ‘DPY DPBS SR YR XW YW FS 
4 NOP (Branch NOP NOP ZORO - (DPX ( DPY (DPX ( DPY NOP i} 
1 # Displa~ DB DB INBS Read Read _ Write Write Fuub L 
2 BR cement) FA FA VALUE * Inde™) Index) Index) Tndex) 2: 
3 BINTRQ (0-37) FM PM - DPX 3 
4 BION DPY (0-7) (0-7)... (0-7) (0-7) 4 
5 BIOZ MD 5. 
6 BFPE SPEN 5. 
7 RETURN ™ 7 
10 BFEQ 19 
11 BENE 11. 
12 BFGE 12. 
13 BFGT 13. 
14 BEQ 14 
15 BNE 15 
16 BGE 16. 
17. BGT 16 
Octal . Octal 
Code Field Name Code 
Ml “M2 MI MA DPA EMA 
g FM FA _ NOP NOP NOP NOP Q. 
1 DPX DPX FA INCMA INCDPA INCIMA 1 
2 DPY DPY IM DECMA DECDPA DECTMA 2 
3 Tae MD DB SETMA SETDPA SETTMA m 3 
* This instruction uses a 16-bit immediate VALUE as a constant or address (in bits 


SPEC Fields 


One of the SPEC Fields may be used per * struction word. 


Octal : 
Code Field Name. 
SPEC _ STEST HOSTPNL SETPSA PSEVEN 
) STEST BFLT PNLLIT JMPA* RPSOA* 
1 HOSTPNL BLT DBELIT . JSRA* | RPS2A* 
2 SPUDA ‘BNC DBHLIT. JMP* _RPSg* 
3 NOP BZC DBLLIT — JSR* RPS2* 
4 NOP BDBN NOP Y oMPT RPSGT 
5 NOP BDBZ = NOP ~  ~JSRT RPS2T 
6 NOP BIFN . NOP /  JMPP NOP 
7 NOP BIFZ NOP - JSRP NOP 
10 SETPSA NOP SWDB NOP WPSQA* 
ED '  * PSEVEN NOP SYDBE © NOP  WPS2A* 
12 | PSODD = +NOP SWDBH NOP- WPSG * 
13 PS NOP SWDBL NOP WPS2 * 
14 SETEZSIT  BFLO NOP NOP WPSOT 
15 _ NOP _ BFLI “NOP NOP VPS2T 
16 NOP BFL2 NOP » NOP NOP 


17. NOP BFL3 — NOP NOP NOP 


I/O Fields 


The S-PAD Fields (D, SOP, 
SOP1, SH, SPS; and SPD) are then disabled for this instruction. 


PSODD_ 


RPS1A* 
RPS3A* 
RPS1* 
RPS 3* 
RPSIT 
RPS3T 
NOP 
NOP 
WPS 1LA* 
WPS3.4* 


» WPod* 


WPS3 * 
WPS1T 
WPS3T 
NOP 
NOP 


PS 


RPSLA* 
RPS FAt 
RPSL? : 
RPSF* 
RPSLT 
RPSFT 
RPSLP 
RPSFP 
LPSLA* 
LPS RA* 
LPSL* 
LPSR* 
LPSLT 
LPS&RT 
LPSLP 
LPSRP 


Octal 
Code 
. SETEXIT 
NOP g 
SETEXA* 1 
NOP 2 
SETEX* 3 
NOP 4 
SETEXT 5 
NOP 6 
SETEXP 7 
NOP ; 10 
NOP Ll 
NOP 12 
MOP 13 
NOP 14 
NOP 15 
NOP 16 
NOP 17. 


One of the I/O fields maybe used per instruction word. The Floating Adder Fields 
(FADD, FADD1, Al, and A2)-are then disabled for this instruction word. 


Octal 
Code ' Field Names 
IO LDREG RDREG INOUT SENSE 

g LDREG - NOP RPSA OUT * SNSA 

1 RDREG LDSPD RS PD ' SPNCUT SPININ 
2 SPMDAV LDMA RMA OUTDA SNSADA 
3 NOP LDT*IA RTMA SPOTDA  SPNADA 
4 INOUT LDDPA . RDPA IN SNSB 

5 SENSE _ UDSP RSPIN SPININ.° SPING 
6 FLAG LDAPS RAPS OUTDA . SNSBDA 
qT CONTROL LDDA . RDA © SPINDA SPNBDA 


FLAG 


SFL2 
SFL1 
SFL2 
SFL3 
CFLY9 
CFLIL 
Crh. 
CFL3 


CONTROL 
HALT 


, LORST 


INTEN 
INTA 
REFR 
WRIEX 
WRIMAN 
NOP ~ 


Octal 
Code 


NQNOau wk wo Oro 


* This instruction uses a 16- bit integer VALUE (in cits 48-83 of the instruction word), 
The YW, Tul, Ml, M3, MI WA, THA, and DPA Fields are then disabled for this instruction 


AP-1208 Instruction Field Cayout 


. . ' . oe . . : 3 
rn 
t os a, Rt caiahe Sia? -aeeh ya nen tee ' , ra aiciy ol 

Sob 2 3 4 § 8 7 8 9 We Ll 22 14 14 45 15 Cr os les t 
[Ae a Pa ce DP GO RES MEL AEE OE a Tne ea eS SSE TNE ES 
: ' " : pes : 4 4 ‘ 
| Bleep en | $P9 | aso A - gas : 
! | So {32 | srg 3 i rAS> ¢t » ; 
a a aR ORR ERR REE ERR eee 
iy 
| 3-Pad Sroup | Addar Group ‘ | 


* 4 
t 


d H 
; : 
2 oz es . “fe -* e; sftietn. st z z. 4% 2 : 
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—_ - POPSET ete Se URE tae pe Gt Toate the tee aN 
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AP-120B Instruction Field Layout 


5 6 7 8 9 10 11 12 13)14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 


Adder Group Branch Group 


SPEC OPER 


32 33 34 35 36 37 38139 40 41 42 43 44 45 46 47 48 49 50 56 57 58 59 60 61 62 63 


br CS Oe a ate ee 


Data Pad Group — | Multiply Group|Memory Group 


VALUE 


:@ XICNAddv. 


AUVWANS NOLLONULSNI 


| INTENTIONALLY BLANK 


S-PA:: GROUP 


one 3/4 516.  £9]f10 43 
rs SOP 1 SE 5 
Eo? aaa | 


| Octal : 
Field . Code Mnemonic Effect 
BO Og a No-Op 
2. | & ; Use SPgpg (bit-reversed) 
soP 0 9 Le See SOP1 field 
| 1 ae ~ See Special Operations Group 
D ADD  (SPgpp)+(SPgpg) *SPEN 
3 _ *SUBY ~ “> (SPgpp)~(SPgpg) *SPFN 
4 MOV §——« (SPgpg) *SPFN 
5 . AND -. (SPgpp) AND(SPgpg)*SPFN 
6 OR  _— (SPgpp) OR(SPgpg)*SPFN 
7 QV (SP gpp JEOR(SPgpg)>SPFN 
*SH iy = No-Op 
1 L SPFN*2+SPFN (left shift) 
a RR SPFN:45SPFN (double right shift) 
3 Roo SPFN+2+SPFN (right shift) 
SPS | 0-178 0-172 S-Pad Source Operand Address' 
SPD 0-17, “0-17, S-Pad Destination Address, .SPEN>°"spp 
’ unless inhibited by No Load (COND=1) 
*wote: “hese are logical shifts: 
“Right shift o +7 das -[d 


Left shift [c¢]l+{_o-is "ds «+6 


Octal 


Field _Code _ Mnemonic Effect 
SOP1 o 2° = No-Op 
1 WRTEXP Restricts DPX, DPY & MI fields to Write 
| Exponent Only 
a | WRTHMN Restricts DPX, DPY & MI fields to Write 
. High Mantissa Only (Bits OO-11) 
3 WRTLMN —- Restricts DPX, DPY & MI fields to Write 
: | Low Mantissa Only (Bits 12-27) 
4 E : 
5 : 3 
6. - 
7. as - 
10. CLR Q+SPFN 
11 INC. (SPgpp)+1+SPFN 
12 DEC (SPgpp)-1+SPFN 
i com  —S—=—Ss (Sgpp) *SPFN 
14  LDSPNL SPgpp + SPFN;~-PNLBS SP aes 
i LDSPE SPSPD + SPFN, DPBSE - 512 + SPgpp 
16 _ | LDSPI -SPgpp 3 SPFN, ppBS“Ls spo, 
oe — LpspT aes SPEN, DPBS"? SPspp 


MH= =Mantissa sete Mantissa bits O0O-11 

ML=Mantissa Low=Mantissa bits 12-27 

MT=Mantissa bits for table jookups= =Mantissa bits 02-08 
E=Exponent 
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SPECIAL OPERATIONS GROUP 


sere 
| _ Octal . i 
Field | Code Mnemonic Effect 
SPEC $6 = « _ See STEST Field (B-6) 
1 -_ See HOSTPNL Field (B-7) 
2 SPMDA .—s—ié«i<ws Sn until MD available 
3 | a oe 
A. > e = 
5 : 2 
6 | 2 . 
ine ‘a See SETPSA Field, inhibit TEST 
| : . except No Load (B-8) 
1 : = See PSEVEN Field (B-9) 
12 - See PSODD Field (B-10) 
SRS: 2 ke See PS Field (B-11) | 
i . See SETEXIT Field (2-12) 
15 . 
16 - - 
17 - - 


Field | Code ions Effect - 
STEST 9 © *BFLT ‘Branch if FA<9.9 
| 1 BLT Branch if SPFN<Q- 
2 BNC Branch if S-Pad carry bit=l 
3 BZC Branch if S-Pad carry bit=9. 
oe BDBN Branch if DPBS<g.9 
5 BDBZ = Branch if DPBS positive and unnormalized. 
6 BIFN Branch if Inverse FFT flag=1 | 
7 BIFZ Branch if Inverse FFT flag=0 
10 a fas | 
ae re a 
12 z2 2s 
13 -- _~ 
be | BFLO@ | Beanen if Flag ¢=1 
15 BFL Branch if Flag 1=1 
16 BFL2 — Branch if Flag 261 
4 BFL3 7 Branch if Flag 3=1 


If the above specified condition is true OR the condition specified 
in the COND field is true, a branch occurs to (PSA)+DISP-20 


Field Code __Mnemoni- Effect — 
HOSTPNL 9 | PNLLIT -PNLBS +LITES 
og DBELIT DPBS=+PNLBS+LITES 
ie. DBHLIT —+-vpBSMHpNLBS+LITES 
3 DBLLIT -DPBSMLpNLBS+LITES | 
ae i a 
5 “ _ 
6 = Z 
7 2 7 
10 - SWDB (SWR) +PNLBS+DPBS 
oe  SWDBE (SWR) +PNLBS+DPBS= and WRTEXP* 
120” ‘SWDBH | (SWR) +PNLBS+DPBS“# and WRTHMAN*: 
1300” SWDBL (SWR) +PNLBS+DPBSML. and WRTLMAN*: 
7 | . “a 
15 er a 
16 S 7 
17 - ve, = 


*Restrict DPS, DPY and MI to: 


WRTEXP: Write Exponent only 
WRTHMAN: Write High Mantissa 
only (bits 00-11) 
WRTLMAN: Write Low Mantissa 

only (bits 12-27) 


MH=Mantissa High=Mantissa bits 00-11 


ML=Mantissa Low =Mantissa bits 12-27 
E=Exponent 


Be? 


Field Code Mnemonic _Efiect_ 


SETPSA 9 IMPA _ -VALUE+PSA 

1 | JSRA (SRA)+1+SRA, (PSA)+1+SRSgpa, VALUE+PSA 
2 IMP _ VALUE+(PSA)+PSA 
3 «JSR (SRA)+1+SRA, (PSA)+1+SRSgpa, “VALUE 

| | | +(PSA)+PSA 

4 IMPT (TMA)+PSA 

<5 JSRT  (SRA)+1+SRA, (PSA)+1+SRSgpa, (TMA)>PSA 
6 JMPP (SWR)>PNLBS>PSA 
7 JSRP | (SRA)+1+SRA, (PSA)+1+SRSgpq, 

| (SWR)*PNLES~PSA 


VALUE=Bits 48-63 of this instruction (CB48-CB63) 


17 


Mnemonic 


RPSQA 
RPS2A 
RPSG 
RPS2 
-RPSQT 


Effect 
(PS2% )+PNLBS+ LITES 
- VALUE 


(PS oe om PNLBS LITES 


Q@ > 
(PS etue +psa) PNLBS LITES 
Q2 , 
(PS Tanugspsa ) “PNLBS* LITES 
(PS24 Ss») PNLBS+ LITES 
“PMA 
(PS22 =) + PNLBS+ LITES 
| TMA 7 | | 
(SWR)  -+PNLBS+ps2? | 
VALUE 
(SWR) | +PNLBS+Ps 2? 
"VALUE 
| | fo) 
(SWR) *PNLBS+PS en 
(SWR) . +PNLBS+Ps 2? 
| VALUE+PSA 
(SWR) | +PNLBS+ps 9% 
TMA 
(SWR)  +PNLBS+Ps 2? 
TMA 


This field requires 2 cycles to’ execute 


VALUE = Bits 48-63 of this instruction (CB48-CB63) 
4% = Quarter zero of Program Source Word (PS@@-PS15) 
Q2 = Quarter two of Program Source Word (PS31-PS47) 


Field Code __——s Mnemonic 
PSODD  ~—=—_C «9 RPS1A | 
ee RPS3A 
2 RPS1 
3 RPS3 
4 RPS1T 
5 RPS3T 
6 Ss. 
7 7 
10  WPS1A 
aie  WPS3A 
12 WPS1 
13. WPS3 
14 WPSIT. 
as: - WPS3T 
16 : 
i7 = 
ane eee 


(PS 


Effect. 


Ql 2 7PNLBS+' LITES 
VAL 

(PS23 _)+PNLBS+ LITES 
VALUE ; 


(ps Qt )+PNLBS+ LITES 


VALUE+PSA 


Q3 3 | 
(PS vatue+psa )7PNLBS>+ LITES 


(PSS )+PNLBS+ LITES 


(PS23 )+PNLBS+ LITES 


| Qu 
((SWR) >PNLBS+PS¢. 


) Q3 
(SWR) >+PNLBS+PS Sarde 


Ql 
(SWE). OPNLBS RS VALUE+PSA 


res 
(SWR) .*PNLBS*+PS 7 uB+PSA 
, Ql 
(SWR) +PNLBS+PS© 


03 
(SWR) +PNLBS+PS?? 


This field requires 2 cycles to execute. 


VALUE=Bits 48-63 of this instruction (CB48-CB63) 
Ql=Quarter one of Program Source Word (PS16-PS31) 
Q3=Quarter three of Program. Source Word (PS48-PS63) 


Field 
PS 


17 


Mnemonic 


RPSLA | 


RPSFA 


RPSL 


RPSF 


~ RPSLT 


RPSFT 
RPSLP 


RPSFP . 


LPSLA 
LPSRA 
LPSL 
LPSR 
LPSLT 
LPSRT 
LPSLP 
LP SRP 


A IE GTC A SEAS fee rr st ry tnnesrutt—met 


(PS 


Effect 


lf 
(PS varur+psa )7DPBS_ 


g FP 
~ VALUE+PSA 


LH 
TMA )+DPBS 


(P 


(PST )+DPBS 


RH 
DPBS+PS VALUE 


a: 
DPBS>PS vaTuE+PSA 


RH 
DPBS+PS VALUE+BA 


LH 
DPBS+PS waa 


- Se 
_ DPBS+PS fin 


ppBs-+ps 
. PNLBS 


| RH 
DPBS+PS pnts 


This field requires 2 cycles to execute. 


-VALUE=Bits 48-63 of this instruction (CB48-CB63) 
LH=Left half of Program Source Word (Bits 00-31) 
RH=Right half of Program Source Word (Bits 32-63) 


FP=Program Source bits 26-63, used for floating-point literals 


Field Code Mnemonic Effect 


SETEXIT 9 | ao = 


7 _ SETEXA | VALUE+SRSgpq_ 

2 a ae 

3 SETEX VALUE+(PSA)>SRSopq 
; ; _—— : . 
5 ss SETEXT . TMA*SRSopq 

a 4) (Ue = 

7 | SETEXP PSAtL +SRSop 


Sets the current subroutine petarn address as indicated Spove. 
SRA does not change. 
- VALUE=Bits. 48-63 of this PaSerucrion.: 


Octal 
Field Code. -Mnemonic 
FADD © Q ~ 
| . 2 FSUBR 
2 FSUB 
3 | FADD 
4 : FEQV 
5 FAND 
6 FOR 
wg = 
Al Q NC. 
an FM 
2 DPX(1DxX) 
3 DPY(1DX) 
4 TM 
5 ZERO 
: : | 
7 = 


FLOATING ADDER GROUP 


14 16[17 19|20 22 
FADD Al 
FADD1 


Effect 


‘See FADD1 field 


Subtract: (A2) - (Al) 


Subtract: (Al) - (A2) 


Logical Equivalence: (Al) 


Logical and: 


Lozical or: 


See I/O Group 


(Al1)+Al 


FM>A1 


Add: (AL) + (A2) 


SOR (A2) 


(Al) AND (A2) 


(Al) OR (A2) 


(DPXppa+ipx )*Al where XR=1DX+4 


(DPYppa+1pDx )*Al Where YR=1DX+4 


(TM)+Al 
G.9> Al 


Note: ei Pres ne adder op-codes: 


renee exponents 


Perform the specified arithmetic, logical, 
operation : 


Normalize 


Convergently round 


or shift 


: Octal . 
Field Code “Mnemonic Bffect 


AQ Qo NC. (A2)+ a2. 

4 FA PA+A2 

2 DPX (1DX) {PPE ppat LDX )+A2, Where XR=1DX+4 
3 DPY (1DX) ( DPY Dae 1DX )+A2, Where YR=1DX+4 
4 MD (MD)+A2 

5 ZERO G.G>A2 | 

| M 

G MDPX (1DX) SPFN+512+a2") (DPE ppa +1Dx )+a2”™! 
7 -EDPX (1DX)  (DPx® M 


DPA+ 1DX )+a2" »SPEN +A2 (99-91), 


g a2" (92-27) 


FADD1 a, 7 _ No-Op 
a _ FIX Convert (A2) to an integer 
2 . FIXT Convert (A2) to an integer (result 


truncated ) 


3 FSCLT Snift (AZ) rignt and increment ag® 
"3 until A2> =(SPFN+511) (result trun- 
cated). 
a FSN2C _ Convert (42), from signed Magnitude 


to 2's complement. 


3 _ F2CSM Convert (A2) from 2's complement 
to signed magnitude. 
’ Tv 
FSCALE Shift @2)right and increment A2” 
until Aga= SPFN+S11. 


oO) 


Z  FABS © Take the absolute value of (A2). 


Field 
r/O 


LDREG 


NOD on nes Ww bo 


I/O GRUUP 


Mnemonic Effect 
_ See LDREG field 
- See. RDREG field 


SPMDAV Spin until MD available 


i.e See INOUT field 
: See SENSE field 
= See FLAG field 


o See CONTROL field 


- No-Op 

LDSPD | DPBS+SPD 

LDMA - DPBS+MA 

LDTMA ==—s«dDPBS+TMA 

LDDPA DPBS+DPA 
-EDSP ~ | _SPgpp*SPEN, DPBS+SP,0, 
-LDAPS .  —“~DPBS+APSTATUS, 


-LDDA : - DPBS+DA 


7 — Octal | 
Field Code Mnemonic Effect 


RDREG g —RPSA (PSA)+PNLBS 
| 1 RSPD — -(SPD)+PNLBS 
2 RMA (MA)+PNLBS 
3 RTMA (TMA )+PNLBS 
4 BREA (DPA)>PNLBS 
5 RSPEN SPFN+PNLBS 
6 RAPS (APSTATUS )>PNLBS 
7 RDA (DA)*+PNLBS 
INOUT g OUT DPBS*IODEVICEp a 
1 - SPNOUT SPIN if IODRDYp,=9 
: DPBS+IODEVICE 
| DA 
2 OUTDA DPBS+IODEVICE,,, SPFN+DA 
3 | SPIN if IODRDY.,=9, SPFN+DA 
| Spoeee DPBS+IODEVICE pa | 
4 EN ( IODEVICE,, , )+INBS 
5 SPININ SPIN if IODRDYpa=90 
. (IODEVICE,, )+INBS 
| DA 
6 “INDA: (IODEVICE,,)+INBS, SPFN+DA 
t SPINDA SPIN if IODRDYp,=%, SPFN+DA 


ae INBS 
-(IODEVICE,, ,)>IN 


I Oo oa B w 


Mnemonic 
SNSA_ 
SPINA 


SNSADA 


- SPNADA 


SNSB 


SPINB 


SNSBDA 


SPNBDA 


Effect _ 


ADA*IODRDY Flag - 


Apa*IODRDY, 
Apa*IODRDY, 
Apa>IODRDY, 
Bp a7 IODRDY 
Bpa~IODRDY, 
Bpa+IODRDY, 


Bpa~IODRDY, 


Se IN if 


SPFN+DA 


‘SPIN -if 


Flag 
SPIN if 
SPFN+LA 


SPIN if 


IODRDY=9. 
IODRDY=9, SPFN+DA 
IODRDY=9 


IODRDY=9, SPFN+DA 


A and B are I/O device dependent condtions, either 1 or @ 


FLAG 


D 
1 


NY SG Oo BP WW bb 


SFL@ 
SFL1 


SFL2 


SFL3 


CFLY 
CFL1 
CFL2 
CFL3 


1+FLAGg 


1+FLAG} 


1+FLAG3, 


9+FLAGS 


9+FLAG, 


Field 


CONTROL 


Mnemonic. 


HALT | 


IORST 


INTA 


REFR 


WRTEX 


WRTMAN | 


Effect 
Halt 
I/O reset 


Interrupt acknowledge. Device 
Address of interrupting device 
put onto DPBS. 


Memory refresh sync 


Restricts DPX, DPY & MI to 


Write exponent only 


Restricts DPX, DPY & MI to 
Write Mantissa Only (Bits 0-27) 


BRANCH GROUP 


23 26 |27 7 34 


DISP 
| Octal | | . 
Field Code | Mnemonic Effect 
COND Q 2. ‘No-Op 
mo. 3. ae , Inhibit load of SPFN~SPspp 
2 ' BR . Branch always 
3 BINTRQ | Branch if INTRO (Interrupt Request) 
| flag=1 
4 ; BION _ Branch if IODRDY,, flag=1 
5 +  — BIOZ Branch if IODRDY,, flag=0 
6 ae BFPE | peanehy ch floating-point arith- . 
: a 2 metic error (overflow, underflow, | 
or divide by zero) - 
o RETURN (SRSgpq)>PSA, (SRA)-1+SRA (Sub- 
. routine return jump. 
NOTE: "RETURNS" may not be made in two successive instructions. 
Re BFEQ Branch if FA=9.9._ 
1 + BFNE Branch if FA#0.9 
iz. BFGE Branch if FA>9%.@ 
1300CtC~«:*W BFGT Branch if FA>@.@ 
14 BEQ -—- Branch if SPFN=9 
i BNE Branch if SPFNA9_ 
16  BGE_ Branch if SPFN>9 
| 17. BGT Branch if SPFN>9 
Note: FA and SPFN are tested as to their Se eee: for the previous 
8 instruction. 
DISP ty) con ane ff pranch condition is true, (PSA) | 


+DISP-20+PSA 


Thus the effective Branch panes is -20 to +17 relative to the 
curren instruction. 


B-19 


DATA PA! GROUP 


32 33(34 35[36 38/39 41f42 44fas 47 [48 50 
DPX DPBS XR | 


Octal 
Field Code Mnemonic 
DPX ) = 
1. = ——sdDPX(1DX) <DB 
a. - DPX(1DX) <FA 
23 -- DPX(1DK) <FM 
DPY 9 7 
at DPY(1DX)<DB 
2 | ‘DPY(1DX)<FA 
an DPY(1DX)<FM 


Effect 


_No-Op 


DPBS~+*DPXppa+1DX, Where XW=1DX+4 
FA>*DPXppA+1DX, Where XW=1DX+4 


FM>*DPXppx+1DX, . Where XW=1DX+4 


No-Op 


-DPBS+*DPYppa+ipxX Where YW=1DX+4 


FA>*DPYppa+1DX Where YW=1DX+4 


FM>*DPYppa+1pDX Where YW=1DX+4 


*AJ1 bits written unless WRTEXP, WRTHMAN or WRTLMAN set. See SOP1 


and HOSTPNL fiéld. 


DPBS Gg DB=ZERO 
1 _ DB=INBS 
2 - DB=VALUE 
3 - DB=DPX(1DX) 
4 DB=DPY(1DX) 
5 DB=MD 
6 DB=SPFN 
7 DB=TM 


Q.9+DPBS 
INBS+DPBS 


VALUE+DPBS=, VALUE+DPBSML, 
Sign extended into PE ie 


(DPX *DPBS, Where XR=1DX+t4 


DPA + 1DX) 
YR=1DX+ 
(DPYp pa + 1px)7DPBS, Where 1DX+4 


(MD) DPBS © 


SPFN + 512*DPBSE, SPFN+DPBSML , 
Sign extended into DPBS), 


(TM)+DpBs " 


DPBS forced to 9 if HOSTPNL field=10 to 13 


ML=Mantissa Low (Mantissa Bits 12-27) 
MH=Mantissa High (Mantissa Bits 00-11) 


E=Exponent 


VALUE is a 16-bit 2's complement number, 


the instruction word. 


contained in bits 48-63 of 


‘Octal 


Field Code_———s Mnemonic _Effect 

XR @ to 7 DPX Read EFA is (DPA)+XR=4 
YR to 7 DPY Read EFA is (DPA)+YR-4 
XW | 9 to 7 DPX Write EFA is (DPA)+XW-4 
yw ® to 7 DPY Write EFA is (DPA)+YW-4, 


YW=XW if VALUE is used in 
another field 


B=-21 


_ FLOATING MULTIPLIER GROUP 


51152 53154 a 


/FM| M1 M2 
Octal | 
Field Code Mnemonic. Effect 
FM g - No-Op _ 
| al _ FMUL Multiply: (M1)*(M2) 
M1 Q FM | FM>M1 
1 DPX (IDX) (DPXppa+ ipx)*M1l, where xR=1Dx+4 


2 DPY (1DX) (DPYppa+ 1px )*Ml, where YR=1Dx+4 
30 TM. , (TM)>M1 | 


M20 p | FA FA+M2 
| LL DPX (IDX) = (DPXppa+ 1px )*M2, Where XR=1DX+4 
2 DPY (ADK) = CDPY pas 1Dx )>M2, Where YR=1DX+4 

a.) * . ie 2s x (MD) +612 | | | 


Note: These fields are not in effect if VALUE is used in 
7 another field. 


Arguments that are unnormalized by more than one noetrR will 
produce incorrect results. 


 -B-22 


MEMORY GROUP 


156 57158 39/60 61/62 63 
LMI | MA [ DPA TT Tmaé | 


Octal 


Field Code Mnemonic Effect 
MI g , —_ ae No-Op 
1 - MI<PA FA*MI, write MI into Data Memory** 
oO . MI<FM FM*MI, write MI into Data Memory** 
3 MI <DB DPBS+MI, write MI into Data Memory** 


*KATL bits written gatese WRTEXP, WRTHMAN or WRTLMAN is set. 
See SOP1 and HOSTPNL fields. 


tt 


MA go = | No-Op 


as on INCMA (MA)+1°MA, intitate a Data Memory _ 
cycle ; ee 
2 DECMA . (MA)-1°MA, eae a Data Memory . 
cycle ae 
| 3 SETMA *“SDFN-MA, initiate a Data Memory cyelé 
date is used in place of SPFN if LDREG field is used. 
DPA g é - -No-Op 
1 INCDPA Ss (DPA) +1+DPA 
2 DECDPA - (DPA)~1+DPA 
3 SETDPA *SPFN+DPA 


*DPBS is used in gees of SPFN if LDREG field is used. 

Note: These fields are not in effect if a value is used by 
another field. Changes made in MA, TMA, or DPA do not 
affect the values of these registers used by other 
fields re the current instruction. 


Code _ Mnemonic 
~ = 

1 INCTMA 

2 DECTMA 

3 _ SETTMA 


Effect 


No-Op 


‘((TMA)+15TMA, initiate a read from 


Table Memory 


(TMA)+1+TMA, initiate a read from 


Table Memory 


*SPFN+TMA, initiate a read from 


Table Memory 


LS tA Se As pr Se Se SPs ly sno sSnioh st eStnSGVSSSshebsurnnshs 


*DPBS is used in place of SPFN if LDREG field is used. 


Note: 


These fields are not in effect if a VALUE is used by 
another field. Changes made in MA, TMA, or DPA do not 
affect the values of these registers used by other fields 


‘during the current instruction. 


AP-120B Instruction Field Layout 


‘0 1 2.3 4 3 6 ei 8 9 10 11 ~=612~«6:13{14 #15 16 #17 ~=#«+18 «19 20 21 22/23 24 #25 26 27° 28 #29 #30 31 
H SPS - 


_ S-Pad Group ; . Adder Group Branch Group | 


~ SOP] . 


SPEC OPER 


32. 33 «334 #35 #36 #37 «38{ 39 40 41 42 43 #44 «#45 #46 «47 48 49 50[/5]1 52 53 54 55 56 57 58 59 60 61 62 63 


DPX DPY DPBS __ XR XW YW FM | ML M2 MI DPA. | TMA 


VALUE 


| FLOATING. POINT SYSTEMS, INC. . 
PO, BOX 23489 PORTLAND, OR97223 OOO SW. IITH STREET, BEAVERTON, OR97005 (503) 6413151 TLX:360470 FLOATPOINT PT 


